(19) 




Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 







(11) 



EP1 391 119 B1 



(12) 



EUROPEAN PATENT SPECIFICATION 



(45) Date of publication and mention 
of the grant of the patent: 
14.06.2006 Bulletin 2006/24 

(21) Application number: 02785955.2 

(22) Date of filing: 28.1 1 .2002 



(51) IntCI.: 

H04N 7/24< 200601 > H04N 5/76 1 2000 01 * 

G11B20/10< 200601 > 

(86) International application number: 
PCT/JP2002/012414 

(87) International publication number: 

WO 2003/047261 (05.06.2003 Gazette 2003/23) 



(54) A METHOD AND AN APPARATUS FOR DATA RECORDING 

VERFAHREN UND VORRICHTUNG ZUM DATENAUFZEICHNEN 
PROCEDE ET APPAREIL D'ENREGISTREMENT DE DONNEES 



m 



O) 
CO 



Q. 
LU 



(84) Designated Contracting States: 

AT BE BG CH CY CZ DE DK EE ES Fl FR GB GR 
IE IT LI LU MC NL PT SE SK TR 

(30) Priority: 30.11.2001 JP 2001367788 

(43) Date of publication of application: 
25.02.2004 Bulletin 2004/09 

(73) Proprietors: 

• MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. 
Kadoma-shi, Osaka 571-8501 (JP) 

• Koninklijke Philips Electronics N.V. 
5621 BA Eindhoven (NL) 

• SONY CORPORATION 
Tokyo 141-0001 (JP) 

(72) Inventors: 

• NAKAMURA, Kazuhiko 
Hirakata-shi, Osaka 573-0084 (JP) 

• YAHATA, Hiroshl 
Osaka 571-0058 (JP) 



• KAWASAKI, Kojiro 
Katano-shi, Osaka 576-0041 (JP) 

• VAN GESTEL, Wilhelmus, Jacobus 
NL-5656 AA Eindhoven (NL) 

• KELLY, Declan, Patrick 
NL-5656 AA Eindhoven (NL) 

• KATO, Motoki, 

c/o SONY CORPORATION 
Tokyo 141-0001 (JP) 

• NAKAMURA, Masanobu, 
c/o SONY CORPORATION 
Tokyo 141-0001 (JP) 

(74) Representative: Eisenfuhr, Speiser & Partner 
Patentanwalte Rechtsanwalte 
Postfach 10 60 78 
28060 Bremen (DE) 



(56) References cited: 
EP A- 0 843 482 
US-A1-2001 009 548 
US-B1-6 456 783 



EP-A- 0 899 964 
US-B1-6 236 663 



Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give 
notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in 
a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art. 
99(1) European Patent Convention). 



Printed by Jouve, 75001 PARIS (FR) 



t 



EP1 391 119 B1 



Description 

Technical Held 

5 [0001] The present invention relates to a readable, recordable data recording medium for recording multimedia data 
including moving picture (video) data, still picture data, audio data, and formatting data for data broadcasting. This 
invention also relates to a system and method for recording data to this data recording medium. 

Background Art 

10 

[0002] Where 4.7 GB was until recently the maximum storage capacity for rewritable optical discs, phase-change 
DVD- RAM media with a storage capacity of tens of gigabytes are now available. DVD- RAM media are already used as 
a storage medium in the computer industry, and are expected to soon be used as a recording and playback medium in 
the audio-video (AV) field as a result of the development of economical encoders and decoders implementing the M PEG- 
'S 1 and MPEG-2 digital AV data coding standards. 

[0003] Digital broadcasting has already started in Japan, making it possible to multiplex video, audio, and data for 
multiple programs to an MPEG transport stream (MPEG-TS below). Digital broadcast recorders using hard disk drives 
or DVD drives are also available. 

[0004] These next-generation digital broadcast recorders often record the broadcast content as it was broadcast 
20 without converting the MPEG-TS. So that the recorder does not need to be able to internally process both the MPEG- 
TS and MPEG program stream (MPEG-PS below), these recorders are expected to encode even external analog AV 
content from line input terminals (i.e., user content) to the MPEG-TS for recording. 

[0005] The current DVD theoretical standards (such as DVD-Video, DVD-Audio, DVD Video Recording, and DVD 
Stream Recording standards) use the MPEG-PS for AV stream recording. This means that to convert content recorded 
25 using the MPEG-TS, such as in the above digital broadcast recorder, to the DVD-Video format, for example, the MPEG- 
TS must be converted to an MPEG-PS. 

[0006] Converting an M PEG-TS multiplexed content stream to an M P EG-PS, however, requires complex computations 
for decoder buffer management. The conversion process therefore takes longer, requires re-encoding the elementary 
stream, may degrade the image and sound quality, and is thus generally difficult to accomplish. 
30 [0007] US 6,236,663 discloses an information reproduction apparatus and a corresponding information reproduction 
method for performing special reproducing operations including fast forward and fast reverse operations on audio-video 
material. Information having a plurality of clips can be read and reproduced. 

[0008] US 2001/009548 A1 discloses a method and apparatus for converting an input data stream having an MPEG- 
2 Transport Stream (S format) into an output data stream having an MPEG-2 Program Stream (PS) format. 
35 [0009] EP 0 899 964 A2 discloses a data transmitting device, a data receiving device and a data recording device. 
The data transmitting device is connected to a prescribed network. In particular a PS to TS converter is disclosed which 
requires no re-encoding. 

Disclosure of Invention 

40 

[0010] The present invention is therefore directed to solving these problems, and an object of this invention is to 
provide an apparatus and a method for recording a recording apparatus and method for recording data to an MPEG 
transport stream whereby content recorded in an MPEG-TS format can be converted quickly and simply to an MPEG- 
PS format. 

45 [0011] A recording apparatus according to the invention as defined in claim 1 is provided for multiplexing video infor- 
mation and audio information to record the information to a recording medium as a first stream in a constrained format 
enabling conversion from the first stream to a second stream, the first stream having a structure for storing data segmented 
in first blocks, the second stream having a structure for storing data segmented in second blocks, the maximum data 
size of the first and second blocks being different. 

so [0012] In the recording apparatus, transfer rate conditions allowed for audio data and video data may be different in 
the first stream and second stream. The control section may control the encoding section such that the first stream meets 
these transfer rate conditions in both the encoded first stream and anticipated second stream. 
[0013] According to these transfer rate conditions, the maximum transfer rate allowed for a first block storing video 
data in the first stream may be greater than or equal to the maximum transfer rate allowed for a first block storing audio 

55 data. The maximum transfer rate allowed for a second block storing video data in the second stream may be equal to 
the maximum transfer rate allowed for a second block storing audio data. 

[0014] A recording method according to the present invention as defined in claim 4 is provided for multiplexing video 
information and audio information to record the information to a recording medium as a first stream in a constrained 



2 



T 



EP1 391 119 B1 

format enabling conversion from a first stream to a second stream, the first stream having a structure for storing data 
segmented in first blocks, the second stream having a structure for storing data segmented in second blocks, the 
maximum data size of the first and second blocks being different. 

[001 5] A program according to the present invention as defined in claim 5 enables a computer to execute the recording 
5 method of the invention. 

< Effects of the lnvention> 

[001 6] A data recording and reproducing apparatus according to the present invention efficiently encodes and decodes 
10 externally input AV data while maintaining decoder compatibility when self-encoding the AV data to an MPEG transport 
stream. The MPEG-TS recorded to the data recording medium is multiplexed in block units of 2 KB or less to improve 
compatibility and convertibility to an MPEG-PS. Because the MPEG-TS is multiplexed to allow for easy conversion to 
an MPEG-PS, the MPEG-TS can be converted to an MPEG-PS quite easily with sequential processing of transport 
stream packets without considering .buffer management. Furthermore, by defining the relationship between the.TS 
15 packettransfertiming information (ATS) and PS pack transfer timing information (SCR) and the correlation to the header, 
the MPEG-TS can be dependably converted to an MPEG-PS conforming to a specified decoder reference model. 

Brief Description of Drawings 

20 [0017] 

Fig. 1 is a schematic diagram of an exemplary interface between a DVD recording apparatus and other components 
used in conjunction therewith; 

Fig. 2 is a block diagram of the drive of a DVD recorder; 
25 Fig, 3A isa graph illustrating the correlation between data accumulation in a track buffer, and Fig. 3B is a contiguous 

area on disc; 

Fig. 4 is a block diagram of a DVD recorder having a semiconductor memory card and hard disk drive; 
Fig. 5A shows physical structure of a typical disc, and Fig. 5B shows a format of a typical disc; 
Figs. 6A and 6B show logical data spaces of the disc; 
30 Fig. 7A shows the disc directory, Fig. 7B shows the file structure; 

Fig. 8 shows the structure of a video object; 
Fig. 9 shows the MPEG system stream; 

Figs. 10A to 10C show the MPEG transport stream (MPEG-TS); 

Figs. 11 A to 11C show the MPEG program stream (MPEG_PS); 
35 Figs. 1 2A to 1 2D show a TS packet; 

Figs. 13A, 13B, 13C1 and 13C2show examples of PAT table and PMAP table; 

Figs. 14A to 14C show the arrangement of video objects on disc; 

Figs. 15A and 15B show the data structure of video management information; 

Figs. 1 6A and 1 6B show the data structure of video management information; 
*o Fig. 1 7 shows the relationship between an object, object information, and PGC information in the video management 

information; 

Fig. 18 is a block diagram showing the functional configuration of a playback (reproducing) apparatus; 
Fig. 19 is a block diagram showing the functional configuration of a recording apparatus; 

Fig. 20 describes the correlation between an MPEG-TS encoded for easy conversion to an MPEG-PS, and the 
45 MPEG-PS after conversion; 

Fig. 21 is a block diagram of the encoder of a data recording apparatus according to the present invention; 

Fig. 22 shows differences in processes for converting from a self-encoded MPEG-TS to DVD formats due to differ- 
ences in system encoding; 

Fig. 23 shows the data structure of Tip packet; 
50 Fig. 24 shows the data structure of DataJD; 

Fig. 25 shows the data structure of display_and_copyjnfo; 

Fig. 26 shows the data structure of encodejnfo; 

Fig. 27 shows the data structure of PESJnfo; 

Fig. 28 shows the data structure of Makers Private Data; 
55 Fig. 29A describes PID of Tip packet, and Fig. 29B describes streamjype of Tip packet; 

Fig. 30 shows the field values of the PES packet header in a Constrained SESF stream; 

Fig. 31 shows PES_extension_flag and PES_header_data_length in a Constrained SESF stream; 

Fig. 32 shows an example of an MPEG-TS self-encoded such that it does not conform to the T_STD model; 
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Figs. 33A and 33B show an example of an MPEG_PS converted from an MPEG-TS such that the MPEG_PS does 
not conform to the P_STD model; 
Fig. 34 shows SCR calculation; 

Fig. 35 shows the elementary stream attributes of a Constrained SESF when encode_condition = "1 1b M ; 

5 Fig. 36 shows the elementary stream attributes of a Constrained SESF when encode condition = "01 b"; 

Fig. 37 shows the standard stream structure in DVD Video; 

Fig. 38 shows part of the data structure of the pack header of a pack in MPEG-2 program stream; 

Fig. 39 shows part of the data structure of the packet header in an MPEG-2 program stream; 

Figs. 40A and 40B show conversion from a Constrained SESF to an MPEG_PS for a video pack; 
™ Figs. 41 A and 41 B show conversion from a Constrained SESF to an MPEG_PS for an audio pack; 

Fig. 42 is a table of audio bit rates allowed by the Constrained SESF, and the maximum payload length stored to 

one audio PES packet for AC-3 and MPEG-1 Audio at the corresponding bit rates; 

Fig. 43 is a flow chart of the overall TS2PS conversion process; 

Fig. 44 is a flowchart of the initialization process in the TS2PS conversion process; 
is Fig. 45 is a flow chart of the capsule unit process in the TS2PS conversion process; 

Fig. 46 is a flow chart of the pack unit process; 

Fig. 47 is a flow chart of the SCR calculation process; 

Fig. 48 is a flow chart of the pack header process; 

Fig. 49 is a flow chart of the packet header process; 
20 Fig. 50 is a flow chart of the stream ID process; 

Fig. 51 is a flow chart of the start-of-PES packet process; 

Fig. 52 is a flow chart of the non-start-of-PES packet process; 

Fig. 53 is a flow chart of the payload process; 

Fig. 54 is a flow chart of the padding packet process; 
25 Fig. 55 shows the Constrained SESF stream format; 

Fig. 56 shows the data structure of PES packet on an MPEG standard; 

Fig. 57A describes conversion from an unconstrained MPEG-TS to an MPEG-PS, and Fig. 57B describes conversion 
from a constrained MPEG-TS to an MPEG-PS; 

Fig. 58A describes buffer management of the MPEG-TS and anticipated MPEG-PS when the bit rate is the same 
30 in the MPEG-TS being converted and the resulting MPEG-PS (when buffer underflow occurs); 

Fig. 58B describes buffer management of the MPEG-TS and anticipated MPEG-PS when the bit rate is the same 
in the MPEG-TS being converted and the resulting MPEG-PS (when buffer underflow does not occur); 
Fig. 59A describes buffer management of the MPEG-TS and anticipated MPEG-PS when the bit rate of the MPEG- 
TS being converted is higher than the bit rate of the resulting MPEG-PS (when a buffer underflow occurs only with 
35 the MPEG-PS); 

Fig. 59B describes buffer management of the MPEG-TS and anticipated MPEG-PS when the bit rate of the MPEG- 
TS being converted is higher than the bit rate of the resulting MPEG-PS (when a buffer underflow does not occur); 
Fig. 60A describes determining the time stamp information (SCR) set in the packs of the converted MPEG-PS (when 
the MPEG-TS and MPEG-PS with the same bit rate); 

Fig. 60B describes determining the time stamp information (SCR) set in the packs of the converted MPEG-PS (when 
the MPEG-TS transfer rate is higher than the MPEG-PS transfer rate); and 

Fig. 61 shows the correlation between the relative transfer time ATS added to each TS packet, and the transfer time 
calculated_PCR[n] of the first TS packet in the multiplexing unit. 

45 Best Mode for Carrying Out the Invention 

[0018] A DVD disc, DVD recorder, and DVD player are described in the following sequence below with reference to 
the accompanying figures as specific embodiments of a data recording medium, recording system, and playback system 
according to the present invention. 
50 [0019] The essential points of the present invention are described in section 8, summary of the invention, and section 
9, detailed description of the embodiments. It will be noted, however, that all of the following are embodiments of the 
invention though they vary in their relationship to the invention. 

1 . Summary of a DVD recorder system 
55 2. Functional summary of a DVD recorder 

3. Summary of a DVD disc 

4. Summary of reproduced AV data 

5. Summary of AV data management information and playback control 
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6. Basic operation of the playback function 

7. Basic operation of the recording function 

8. Summary of the invention 

9. Detailed description of the embodiments 

[0020] Note that for simplicity TS2PS conversion" as used below means converting the MPEG transport stream 
(MPEG-TS) to an MPEG program stream (MPEG-PS), and "DVD format" refers both the format of the DVD-Video 
standard and the format of the DVD-Video Recording standard, both of which are MPEG-PS formats. 

1 . Summary of a DVD recorder system 

[0021] Fig. 1 is a schematic diagram used to describe a DVD recorder and the interface between a DVD recorder and 
other equipment. 

[0022] As shown In Fig. 1, a DVD optical disc is loaded into the DVD recorder for recording and reproducing video 
data. The DVD recorder is typically operated with a remote control. 

[0023] Video data can be input to the DVD recorder using analog signals, such as from analog broadcasts, or digital 
signals, such as from digital broadcasts. Analog broadcasts are generally received by the receiver built in to a television, 
for example, demodulated, and input the DVD recorder as an NTSC or other type of analog video signal. Digital broadcasts 
are usually received and demodulated to a digital signal by a set-top box (STB) receiver and input to the DVD recorder 
for recording. 

[0024] Video data recorded to a DVD is similarly reproduced and externally output by the DVD recorder. As with the 
input, the video can be output as an analog signal or digital signal. Analog signal output can be output directly to the 
television while digital signal output is passed through the STB for conversion to an analog signal before input to the 
television for viewing. 

[0025] In addition to DVD recorders, DVD camcorders and personal computers can also be used to record and play 
back video data to and from DVDs. DVDs containing video data recorded by a device other than a DVD recorder can 
also be loaded into the DVD recorder for playback. 

[0026] Audio data is also normally recorded with the video data in both analog and digital broadcasts, and this audio 
data can be likewise recorded and reproduced by the DVD recorder. 

[0027] Furthermore, the video data is generally moving picture data (such as a movie), but may also be or include still 
pictures. These still pictures can be recorded using the still imaging function of a DVD camcorder, for example. 
[0028] Various digital interfaces can be used to connect the STB and DVD recorder, including IEEE 1394, ATAPI, 
and SCSI. 

[0029] It will also be noted that the NTSC composite video signal is noted above for signals passed between the DVD 
recorder and television, but a component signal in which the luminance signal and color difference signal are sent 
separately could be used. 

[0030] Digital interfaces such as DVI are also being developed to replace the analog interface used for video transfers 
between the AV equipment and television, and the DVD recorder and television are expected to be connected via a 
digital interface before long. 

2. Functional summary of a DVD recorder 

[0031] Fig. 2 is a block diagram showing the functions of a DVD recorder. A typical DVD drive has an optical pickup 
1 01 for reading data from a DVD-RAM disc 1 00, an ECC (error correction code) processor 1 02, track buffer 1 03, switch 
1 04 for changing track buffer 1 03 input and output, an encoder 1 05, and a decoder 1 06. 

[0032] As shown in the figure data is recorded to a DVD-RAM disc 1 00 in sector units as the smallest recording unit. 
One sector contains 2 KB of data. Sectors are then grouped into ECC blocks with 32 sectors/ECC block. The ECC 
processor 1 02 applies error correction to ECC block units. 

[0033] The DVD recorder may also use semiconductor memory cards or hard disk drives as data storage media in 
addition to DVD discs. Fig. 4 is a block diagram of a DVD recorder equipped with a semiconductor memory card and 
hard disk drive in addition to a DVD disc drive. 

[0034] It should be noted that one sector could be 512 bytes, 8 KB, or other unit. Each ECC block could also contain 
only 1 sector, or 1 6 sectors, 64 sectors, or other number of sectors. As the amount of data that can be stored to the disc 
increases, both the sector size and number of sectors in each ECC block are expected to increase. 
[0035] The track buffer 1 03 records AV data at a variable bit rate (VBR) so that AV data can be recorded more efficiently 
to the DVD-RAM disc 100. The DVD-RAM disc 100 read/write rate (Va) is a fixed rate, but the bit rate (Vb) of the AV 
data varies according to the complexity of the content (images in the case of video). The track buffer 103 is therefore 
used as a buffer to absorb this difference between the read/write rate (Va) and AV data bit rate (Vb). 
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[0036] By using this track buffer 1 03 even more effectively the AV data can be recorded non-contiguously to the disc 
100 as described below with reference to Figs. 3A and 3B. 

[0037] Fig. 3A shows the address space of an optical disc. When AV data is recorded to contiguous area [a1 , a2] and 
contiguous area [a3, a4] that is non-contiguous to [a1 , a2] as shown in Fig. 3A, continuous playback of the AV data while 
seeking from a2 to a3 can be sustained by supplying data accumulated in the track buffer 103 to the decoder 106. This 
is shown in Fig. 3B. 

[0038] AV data read starting from address a1 is input to the track buffer 103 from time t1 while data output from the 
track buffer 103 also starts. Data thus accumulates in the track buffer 103 at the rate (Va-Vb), that is, the difference 
between the track buffer input rate Va and the track buffer output rate Vb. This continues to the end of contiguous area 
[a1 , a2] at location a2, that is, time t2. If the amount of data accumulated in the track buffer 1 03 during this time is B(t2), 
data can be supplied to the decoder 1 06 during the period from time t2 to time t3 at which reading from location a3 starts 
by consuming the data B(t2) stored in track buffer 103. 

[0039] In other words, if at least a certain minimum amount of data ([a1 , a2]) is stored before the seek operation, the 
AV data can be continuously supplied to the decoder when a seek occurs. 

[0040] The size of the contiguous area enabling AV data to be continuously supplied to the decoder when converted 
to the number of ECC blocks N_ecc can be determined from the following equation: 

N_ecc = Vb*Tj/((N_sec*8*S_size)*(1-VbA/a)) 

where N_sec is the number of sectors per ECC block, S_size is the sector size, and Tj is the seek performance (maximum 
seek time). 

[0041] There could also be a defective sector in the contiguous area. The size of the contiguous area considering this 
factor can be determined from the following equation: 

N_ecc = dN„ecc+Vb*(Tj+Ts)/((N_sec*8 # S_size)*(1-VbA/a)) 

where dN_ecc is the tolerated defective sector size, and Ts is the time required to skip a defective sector in the contiguous 
area. The resulting size is also expressed as the number of ECC blocks. 

[0042] Reading, that is reproducing, data from a DVD- RAM disc is used byway of example above and it will be obvious 
that the same concept applies to writing, that is, recording, data to a DVD-RAM disc. 

[0043] It will thus be apparent that non-contiguously recorded AV data can be continuously reproduced from and 
recorded to a DVD-RAM disc insofar as a certain minimum amount of data is contiguously recorded to the disc. This 
area is referred to as a contiguous disc area (CD A) with respect to DVD media. 

3. Summary of a DVD disc 

[0044] Figs. 5A and 5B show a plan view and physical structure of a DVD-RAM disc as a type of recordable optical 
disc. It should be noted that a DVD-RAM disc is usually loaded to a DVD recorder in a disc cartridge to protect the 
recording surface of the disc. However, if the recording surface is protected by some other means or a certain amount 
of surface damage is tolerable, the disc could be loaded directly to the DVD recorder without using a disc cartridge. 
[0045] DVD-RAM media are phase change recording media. Data recorded to the disc is managed in sector units, 
and is recorded with an address enabling access. As noted above, 32 sectors are grouped as one error correction unit 
to which an error correction code is added. This unit is called an ECC block. 

[0046] Fig. 5A is a plan view showing the recording area of a DVD-RAM disc as exemplary of a recordable optical 
disc. The DVD-RAM disc has a lead-in area in the center at the inside circumference, a lead-out area around the outside 
circumference, and a data area between the lead-in area and lead-out area. Reference signals for servo stabilization 
when the optical pickup accesses the disc, and media identification signals enabling the type of optical disc to be identified, 
are recorded in the lead-in area. The same reference signals and media ID signals are also recorded to the lead-out 
area. The data area is segmented into sectors (each storing 2048 bytes) as the smallest access unit. 
[0047] The data area of a DVD-RAM disc is also segmented into multiple zones so that a rotational control method 
called 2-CLV (Zone Constant Linear Velocity) can be used for recording and playback. 

[0048] Fig. 5A shows plural zones formed concentrically on the DVD-RAM disc. In this example the DVD-RAM disc 
is divided into 24 zones, labelled zone 0 to zone 23. The rotational angular velocity of the DVD-RAM is set differently in 
each zone such that it increases in proximity to the inside circumference and is constant while the optical pickup accesses 
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data in the same zone. This increases the recording density of the DVD-RAM and enables easier rotational control 
during recording and playback. 

[0049] Fig. 5B shows the lead-in area, lead-out area, and zones 0 to 23 concentrically arranged in Fig. 5A when viewed 
in a line through the disc radius. 

5 [0050] The lead-in area and lead-out area each include a defect management area (DMA). The defect management 
area is for recording location information indicating the location of a sector containing a defect, and substitute sector 
location information indicating in which substitute area the sector substituted for the defective sector is located. 
[0051] Each zone includes a user area in the center of the zone, and a substitute area and an unused area at the 
boundary of the zone. The user area is the area that can be used by the file system as a recording area. The substitute 

w area is the area substituted for a defective sector in the zone. The unused area is an area not used for data recording, 
and is approximately two tracks wide. The sector address is recorded to the same position in adjacent tracks within each 
zone, but with Z-CLV the sector address is recorded to a different position in tracks adjacent to the zone boundary. This 
unused area is therefore provided to prevent sector address detection errors in tracks adjacent to the zone boundary. 
[0052] There are, therefore, sectors not used for data recording at the zone boundaries. A logical sector number (LSN) 

15 is therefore assigned to each physical sector in the user area of a DVD-RAM disc to continuously identify only those 
sectors used for data recording sequentially from the inside circumference. 

[0053] Figs. 6A and 6B show the logical data space of a DVD-RAM disc comprising logical sectors. The logical data 
space is called the "volume space" and is used to record user data. 

[0054] Data recorded in the volume space is managed with afile system. More specifically, volume structure information 
20 for managing a group of sectors storing data as a "file" and a group of files as a "directory" is recorded to the beginning 

and end of the volume area. This embodiment of the invention uses the UDF file system as defined in ISO 1 3346. 

[0055] The above-noted group of sectors are not necessarily located contiguously within the volume space, and can 

be split into separate parts. Of the sectors constituting each file, the file system therefore manages each group of 

contiguous sectors in the volume space as an extent, and manages each file as a set of related extents. 
25 [0056] Figs. 7A and 7B show the structure of a directory and file recorded to DVD-RAM. Below the root directory is 

the VIDEO_RT directory, and below VIDEO_RT are the various object files containing the playback data and a VIDEO 

Manager file containing management information such as the playback sequence and various attributes. 

[0057] Objects are data structures conforming to MPEG standards, and include PS.VOB, TS1_VOB, TS2.VOB, AOB, 

POB, and MNF (Manufacturer's Private Data). 
30 [0058] PS_VOB, AOB, and POB are MPEG program streams (PS), and TS1_VOB andTS2_VOB are MPEG transport 

streams (TS). The program stream has a data structure designed for storing AV data to package media. The transport 

stream has a data structure intended for communications media. 

[0059] PS_VOB, TS1_VOB andTS2_VOB are objects of primarily video data but containing both video data and audio 
data. In principle, TS1_VOB objects are encoded by the DVD recorder with an explicitly managed internal picture 
35 structure. TS2_VOB objects are encoded externally to the DVD recorder, and part of the internal picture structure and 
data structure is unknown. 

[0060] Typically, TS1 _VOB is an object obtained by encoding externally input analog video signal by the DVD recorder 
to the transport stream, and TS2_VOB is an object obtainded by recording externally input digital video signal directly 
to the disc without further encoding by the DVD recorder. 
40 [0061 ] AOB and POB are MPEG program streams. AOB objects contain primarily audio data, and POB objects contain 
primarily still pictures. 

[0062] The MNF is used to store information specific to a particular manufacturer. 

[0063] "Primarily video data" and "primarily audio data" above indicate that a high bit rate is allocated. VOB are used 
in moving picture and similar applications, and AOB are used in music applications. 

45 

4. Summary of reproduced AV data 

[0064] Fig. 8 shows the structure of MPEG data recorded as AV objects to a DVD. 

[0065] As shown in Fig. 8, the video stream and audio stream are segmented and multiplexed. The MPEG standard 
so refers to the multiplexed streams as the system stream. In the case of DVD, a system stream in which DVD-specific 
information is set is called a VOB (Video OBject). The segmentation unit is called "pack" or "packet", and is approximately 
2 KB in size. 

[0066] The video stream Is encoded according to the MPEG standard, while it is compressed with variable bit rate 
such that the bit rate is increased in complex images such as images containing much movement. The pictures in an 
55 MPEG stream are encoded as l-pictures, P-pictures, or B-pictures. l-pictures are spatially compressed and complete 
within each frame. P-pictures and B-pictures are temporally compressed using inter-frame correlations. A series of 
pictures including at least one l-picture is referred to as a Group of Pictures (GOP) in MPEG. GOP is the access point 
for fast play and other special play modes, which are made possible by the presence of at least one intra-frame compressed 
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l-picture. 

[0067] In addition to using MPEG audio, the audio stream of a DVD can be encoded using AC-3, LPCM, or other 
encoding technique. 

[0068] As also shown in Fig. 8 the Video Object Unit (VOBU) is the data unit multiplexing the video data of a GOP 
with the associated audio data. VOBU may include information to manage a section of the moving picture achieved by 
itself as header information. 

[0069] A program stream (PS) and transport stream (TS) are included in the system stream described with reference 
to Fig. 8. As noted above, the program stream has a data structure intended for package media and the transport stream 
data structure is intended for communications media. 

[0070] Fig. 9 shows the concept of the program stream and transport stream data structures. 
[0071] The program stream contains fixed length packs that are the smallest unit for data transfer and multiplexing. 
Each pack contains one or more packets. Both packs and packets have a header part and a data part. The data part is 
referred to as the payload in MPEG. For compatibility with the sector size, the fixed length of a pack in DVD is 2 KB. A 
pack can contain multiple packets, but because packs storing DVD video and audio contain only one packet, 1 pack 
equals 1 packet except in special cases. 

[0072] The data transfer and multiplexing unit of the transport stream contains fixed length TS packets. TS packet 
size is 188 bytes for compatibility with ATM transmissions, a communications standard. One or more TS packets form 
a PES packet. 

[0073] The PES packet concept is common to both the program stream and transport stream, and the data structure 
is the same. Packets stored in program stream packs directly form PES packets, and a group of one or more transport 
stream TS packets form a PES packet. 

[0074] The PES packet is the smallest encoding unit and stores video data and audio data with common encoding. 
More specifically, video data and audio data encoded with different coding methods are not present in the same PES 
packet. However, if the coding method is the same, it is not necessary to ensure the picture boundaries and audio frame 
boundaries. As shown in Fig. 9 plural frames may be stored to one PES packet. 

[0075] Figs. 1 0A to 1 0C and Figs. 1 1 A and 1 1 C show the data structures of the transport stream and program stream. 
[0076] As shown in Figs. 1 0A to 1 0C and Figs. 1 2A to 1 2D, each TS packet contains a TS packet header, adaptation 
field, and payload. The TS packet header stores a Packet Identifier (PID) whereby the video, audio, or other stream to 
which the TS packet belongs can be identified. 

[0077] The Program Clock Reference (PCR) is stored to the adaptation field. The PCR is the reference value for the 
system time clock (STC) of the device decoding the stream. The device typically demultiplexes the system stream based 
on the PCR timing, and then reassembles the video stream and other streams. 

[0078] The Decoding Time Stamp (DTS) and Presentation Time Stamp (PTS) are stored to the PES header. The DTS 
denotes the decoding timing of the picture or audio frame stored to the PES packet, and the PTS denotes the presentation 
timing of the video or audio output. 

[0079] It should be noted that the PTS and DTS need not be written to every PES packet header. Decoding and output 
are possible insofar as the PTS and DTS are written to the header of the PES packet where the first data of the l-picture 
is stored. 

[0080] The TS packet structure is shown in detail in Figs. 12A to 12D. 

[0081] As shown in Figs. 12B to 12D, the adaptation field stores the PCR and a random access presentation flag. 
This flag indicates whether data that is at the beginning of the video or audio frame and can be used as an access point 
is stored in the corresponding payload. In addition to the above-noted PID, the TS packet header also stores a unit start 
indication flag indicating the beginning of a PES packet, and adaptation field control data indicating whether an adaptation 
field follows. 

[0082] Fig. 1 1 A to 1 1C show the structure of packs in the program stream. A pack contains the SCR in the pack 
header and a streamjd in the packet header of packets stored in the pack. The SCR is effectively identical to the 
transport stream PCR, and the streamjd to the PID. The PES packet data structure is also the same as in the transport 
stream, and the PTS and DTS are stored in the PES header. 

[0083] One major difference between the program stream and transport stream is that the transport stream allows for 
multiple programs. That is, in terms of program units, the program stream can carry only one program but the transport 
stream can simultaneously transmit multiple programs. This means that the playback device must be able to identify the 
video streams and audio streams constituting each program carried in the transport stream. 

[0084] Figs. 13A and 13B show the PAT table and PMAP table used to transmit structure information for the audio 
stream and video stream of each program. As shown in Figs. 13A and 13B, the PMAP table stores information relating 
to the combination of video and audio streams used in each program, and the PAT table stores information correlating 
programs and PMAP tables. The playback device can therefore reference the PAT table and PMAP table to detect the 
video and audio streams for the program to be output. 

[0085] How the program stream packs and transport stream TS packets described above are arranged on the disc is 
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described next with reference to Figs. 14A to 14C. 

[0086] As shown in Fig. 1 4A there are 32 sectors In an ECC block. 

[0087] As shown in Fig. 14B, the packs (PS Packs) forming a video object (PS_V0B) of a program stream type are 
located at the sector boundaries. This is because the pack size and sector size are both 2 KB. 
5 [0088] Video objects (TS1_VOB, TS2_VOB) in the transport stream format, however, is recorded to have a 1 88 byte 
packet paired with a 4 byte Arrival Time Stamp (ATS) denoting the decoder input time. When recording an externally 
encoded stream, the ATS is generated and added by the DVD recorder, and indicates the timing at which the packet 
was received by the DVD recorder from an external source. 

w 5. Summary of AV data management information and playback control 

[0089] Figs. 15A and 15B and Figs. 16A and 16B show the data structure of the video management information file 
(Video Manager) shown in Fig. 7 A and 7B. 

[0090] The video management information includes object information describing such management information as 
15 where objects are recorded on disc, and playback control information describing the playback sequence of the objects. 
[0091] Fig. 15 shows an example in which the objects recorded to the disc include PS_VOB#1 - PS_VOB#n, 
TS1_VOB#1 - TS1_VOB#n, and TS2_VOB#1 - TS2_VOB#n. 

[0092] As shown in Figs. 15A and 15B, a PS_VOB information table, TS1_VOB information table, and TS2_VOB 
information table are separately recorded according to the object types. Each of these tables stores VOB information 
20 for each object. 

[0093] The VOB information includes general information about the corresponding object, object attribute data, an 
access map for converting the object playback time to a disc address value, and management information for the access 
map. The general information includes identification information for the corresponding object and object recording time. 
The attributes include video stream attributes (V_ATR) such as the video stream coding mode, the number of audio 

25 streams (AST_Ns), and audio stream attributes (A__ATR) such as the audio stream coding mode. 

[0094] There are two reasons why an access map is required. The first is so that program chain information (playback 
path information to define a playback path) avoids directly referencing object recording positions based on a sector 
address value, for example, and instead can indirectly reference object locations based on the object playback time. 
Object recording positions can change with RAM media as a result of editing the object, for example. This increases the 

30 amount of program chain information that must be updated if the program chain information references object recording 
positions directly based on the sector address. If the objects are referenced indirectly based on the playback time, 
however, it is not necessary to update the program chain information and only the access map needs to be updated. 
[0095] The second reason is that the audio stream typically has two reference bases, the time base and data (bit 
stream) base, but the correlation therebetween is not perfect. 

35 [0096] For example, using a variable bit rate (a method of changing the bit rate according to the complexity of the 
image) is becoming the norm with MPEG-2 Video, an international standard for video stream encoding. In this case 
there is no proportional relationship between the amount of data from the stream start and playback time, and random 
access based on the time base is therefore not possible. An access map is used to resolve this problem by converting 
between the time base and data (bit stream) base. 

[0097] As shown in Fig. 1 5A, the playback control information includes a user-defined program chain information table, 
original program chain information table, and title search pointer. 

[0098] As shown in Fig. 16A there are two types of program chain information: originally defined program chain 
information generated automatically by the DVD recorder to describe all objects recorded during object recording, and 
user-defined program chain information enabling a user to freely define a particular playback sequence. The program 

« chain information is uniformly referred to as PGC information on a DVD, the user-defined program chain information is 
referred to as the U_PGC information, and the original program chain information as the 0_PGC information. The U_ 
PGC information and 0_PGC information are tables listing the cell information describing the cells in the object playback 
period. The object playback period indicated by the 0_PGC information is called an original cell (0_CELL), and the 
object playback period indicated by the U_PGC information is called a user cell (U_CELL). 

50 [0099] A cell indicates the object playback period using the object playback start time and playback end time; the 
playback start and end times are converted by the access map described above to the actual location where the object 
is recorded on disc. 

[0100] As shown in Fig. 1 6B a cell group indicated by the PGC information defines a continuous playback sequence 
reproduced sequentially according to the order of entries in the table. 
55 [0101] Fig. 1 7 shows a specific relationship between objects, cells, PGC, and access map. 

[0102] As shown in Fig. 17 the original PGC information 50 contains at least one cell information 60, 61, 62, 63. 
[0103] Each cell information 60, 61 defines the object to be reproduced as well as the object type, and object 
playback period. The order of the cell Information in the PGC information 50 defines the playback sequence of the objects 
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defined by each cell when the objects are reproduced. 

[0104] Each cell information (cell information 60, for example) Includes a Type 60a indicating the type of specific 
object, an Object ID 60b identifying a particular object, and a start presentation time Start_PTM 60c and end presentation 
time End_PTM 60d in the object on the time base. 
5 [0105] During data playback, the cell information 60 is sequentially read from the PGC information 50, and the objects 
specified by each cell are reproduced for the playback period defined by the cell. 

[01 06] The access map 80c converts the start and end time information contained in the cell information to the object 
address on disc. 

[01 07] This access map is the map information described above and is generated and recorded when the objects are 
10 recorded. The picture structure of the object data must be analyzed in order to generate the map. More specifically, it 
is necessary to detect the l-picture location shown in Fig. 9, and detect the PTS and other time stamp data, that is, the 
l-picture playback time shown in Fig. 10 and Fig. 1 1 . 

[0108] Problems occurring when generating the PS_VOB, TS1_VOB, and TS2_VOB map information are described 
next. 

15 [01 09] As described with reference to Fig. 1 , the PS_VOB and TS1_VOB are primarily generated by the DVD recorder 
encoding a received analog broadcast to an MPEG stream. The l-picture and time stamp data are therefore generated 
by the DVD recorder, the internal data structure of the stream is known to the DVD recorder, and the map information 
can be generated with no problem. 

[0110] As also described with reference to Fig. 1, the TS2_VOB is a received digital broadcast recorded directly to 
20 the disc by the DVD recorder with no intermediate encoding. Because the recorder thus does not generate the time 
stamp information and determine the l-picture locations as it does when recording a PS_VOB, the DVD recorder does 
not know the internal data structure of the stream and must therefore detect this information from the recorded digital 
stream. 

[01 1 1] To do this the DVD recorder detects the l-picture and time stamp information as described below for the map 

25 information of a TS2_VOB recording a stream encoded externally to the recorder. 

[01 1 2] First, l-pictures are detected by detecting the random access indication information (random_access_indicator) 
of the TS packet adaptation field shown in Fig. 12 or detecting the unit start indication information (payload_unit_start_ 
indicator) in the TS packet header. The time stamp is detected by detecting the PTS in the PES header. Note that the 
PCR from the adaptation field or the TS packet arrival time at the DVD recorder can be used instead of the PTS for the 

30 time stamp. In any case, the DVD recorder detects l-picture locations based on information in a high level system layer 
and does not need to analyze the data structure of the MPEG stream video layer. This is because the system overhead 
required to analyze the video layer in order to generate the map information is great. 

[01 1 3] There are also cases in which system layer detection is not possible. The map information cannot be generated 
in such cases and it is therefore necessary to indicate that there is no valid map information. The DVD recorder indicates 

35 this using the map management information shown in Fig. 15 (b). 

[0114] The map management information shown in Fig. 15 (b) contains map validity information and a self-encoding 
flag. The self-encoding flag indicates that an object was encoded by the DVD recorder, and thus indicates that the 
internal picture structure is known and that the map information time stamp information and l-picture location information 
is accurate. The map validity information indicates whether or not there is a valid access map. 

40 [01 1 5] Examples of when the system layer cannot be detected include when the adaptation field is not set and when 
the digital stream is not an MPEG transport stream. Various digital broadcasting standards and formats are used around 
the world, and there will naturally be cases in which the DVD recorder records objects for which it cannot generate a 
map. For example, if a DVD recorder designed for the Japanese market and recording digital broadcasts in Japan is 
used in the United States to record digital broadcasts in the United States, there will likely be cases in which the DVD 

45 recorder cannot generate a map for the recorded objects. 

[01 16] The DVD recorder can, however, sequentially reproduce from the beginning objects for which map information 
is not generated. In this case video from the recorded digital stream can be reproduced by outputting it through a digital 
interface to a STB appropriate to the stream. 

50 6. Basic operation of the playback function 

[01 1 7] The playback operation of a DVD recorder/player for reproducing content recorded to an optical disc as described 
above is described next below with reference to Fig. 18. 

[0118] As shown in Fig. 18 the DVD player has an optical pickup 201 for reading data from the optical disc 100, an 
55 ECC processor 202 for error correction processing of the read data, a track buffer 203 for temporarily storing the read 
data after error correction, a PS decoder 205 for reproducing video objects (PS_VOB) and other program streams, a 
TS decoder 206 for reproducing digital broadcast objects (TS2_VOB) and other transport streams, an audio decoder 
207 for reproducing audio objects (AOB), a still picture decoder 208 for decoding still picture objects (POB), a switching 
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means 210 for changing data input to the decoders 205 to 208, and a controller 21 1 for controlling the various parts of 
the player. 

[0119] Data recorded to the optical disc 100 is read by the optical pickup 201, passed through the ECC processor 
202 and stored to track buffer 203. Data stored to the track buffer 203 is then input to and decoded and output by the 

5 PS decoder 205, TS decoder 206, audio decoder 207, or still picture decoder 208. 

[0120] The controller 21 1 determines what data to read based on the playback sequence defined by the program 
chain information (PGC) shown in Figs. 16A and 16B. Using the example shown in Figs. 16A and 16B, the controller 
21 1 thus first reproduces part (CELL #1) of VOB #1, then part (CELL #2) of VOB #3, and finally VOB #2 (CELL #3). 
[0121] Using the cell information of the program chain information (PGC) shown in Fig. 17, the controller 21 1 can also 

10 capture the type of cell reproduced, corresponding objects, and the playback start and end times of the objects. The 
controller 21 1 inputs data for the object period identified from the cell information to the appropriate decoder. 
[0122] The controller 21 1 also identifies the objects to be reproduced based on the Object ID of the cell information. 
The controller 21 1 also identifies the cell, which is the playback period of the identified object, by converting the Starts 
PTM and End_PTM of the cell information to a disc address value by referencing the access map of the corresponding 

15 VOB information. 

[0123] A player according to this embodiment of the invention also has a digital interface 204 for supplying the AV 
stream to an external device. It is therefore possible to supply the AV stream to an external device through an IEEE 
1394, IEC 958, or other communications means. This is so that, for example, when the player does not have an internal 
decoder for decoding a TS2_VOB not encoded by the recorder/player the TS2_VOB can be output directly without 

20 decoding through the digital interface 204 to an external STB for decoding and presentation via the STB. 

[0124] When the digital data is directly output to an external device, the controller 21 1 determines whether random 
access playback is possible based on the map information shown in Fig. 15 (b). If the access point data flag (random 
access presentation flag) is valid, the access map contains l-picture location information. In this case the controller 21 1 
is able to access and output digital data containing an l-picture to an external device through the digital interface in 

25 response to fast play and other requests from the external device. Furthermore, time-base access is also possible if the 
time access information flag is valid. In this case the controller 211 can access and output digital data including the 
picture data at a specified playback time to an external device through the digital interface in response to a time-base 
access request from an external device. 

30 7, Basic operation of the recording function 

[0125] The configuration and operation of a DVD recorder according to the present invention for recording and repro- 
ducing an optical disc as described above is described next below with reference to Fig. 19. 

[0126] As shown in Fig. 19 the DVD recorder has a user interface 222 for receiving user requests and displaying 
35 information and prompts to the user, a system controller 212 handling the overall management and control of the DVD 
recorder, an analog broadcast tuner 213 for receiving VHF and UHF broadcasts, an encoder 214 for converting analog 
signals to digital signals and encoding the digital signals to an MPEG program stream, a digital broadcast tuner 21 5 for 
receiving digital satellite broadcasts, an analyzer 216 for interpreting the MPEG transport stream sent from a digital 
satellite, a presentation unit 21 7 such as a television and speakers, and a decoder 21 8 for decoding the AV stream. The 
<o decoder 21 8 has first and second decoders, for example, such as shown in Fig. 1 8. The DVD recorder also has a digital 
interface 21 9, track buffer 220 for temporarily storing write data, a drive 221 for writing data to the disc, and a converter 
223. The digital interface 219 is an IEEE 1394 or other communications interface for outputting data to an external 
device. The converter 223 converts the transport stream to a program stream according to the flow chart shown in Fig. 
37 and described further below. 
45 [0127] With a DVD recorder thus comprised the user interface 222 first receives a request from the user. The user 
interface 222 then passes the request to the system controller 212, and the system controller 212 interprets the user 
request and instructs the various modules to run appropriate processes. 

[01 28] Recording includes self -encoding in which the DVD recorder encodes the input digital data, and outside encoding 
for recording already encoded digital data to disc without further encoding. 

50 

7.1 Recording by self-encoding 

[0129] Recording with self-encoding is described first below using by way of example encoding and recording an 
analog broadcast to a PS_VOB stream. 
55 [01 30] The system controller 21 2 sends a receive command to the analog broadcast tuner 21 3 and an encode command 
to the encoder 214. 

[0131] The encoder 214 then video-encodes, audio-encodes, and system-encodes the AV data from the analog 
broadcast tuner 213, and passes the encoded data to the track buffer 220. 
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[0132] Immediately after encoding starts, the encoder 214 sends the time stamp data at the beginning of the MPEG 
program stream being encoded to the system controller 212 as the playback start time (PS_VOB_V_S_PTM), and 
parallel to the encoding process sends the data required to create the access map to the system controller 212. This 
value is set as the Start_PTM of the cell information shown in Fig. 1 7 and generated later. The time stamp information 

5 is generally the PTS, but the SCR can be used instead. 

[0133] The system controller 212 then sends a record command to the drive 221 , and the drive 221 thus extracts and 
records data accumulated in the track buffer 220 to the DVD-RAM disc 1 00. A contiguous data area (CDA) as described 
above is also found in the recordable area of the disc and the data is recorded to the located contiguous data area. 
[01 34] Recording typically ends when the user inputs a stop recording command. Stop recording commands from the 

10 user are input through the user interface 222 to the system controller 212, and the system controller 212 then sends a 
stop command to the analog broadcast tuner 213 and encoder 214. 

[01 35] The encoder 21 4 stops encoding when it receives the stop encoding command from the system controller 21 2, 
and sends the time stamp data of the last data in the last encoded MPEG program stream to the system controller 21 2 
as the playback end time (PSJ/OB J/_E_PTM). This value is set as the End_PTM of the cell information shown in Fig. 
is 1 7. The PTS is normally used for the time stamp information but the SCR can be used instead. 

[0136] After ending the encoding process the system controller 212 generates the playback control information and 
VOB information (PSJ/OBI) for the PS_VOB shown in Fig. 15. 

[0137] The VOB information generated here includes map management information and an access map appropriate 
to the object type. The system controller 21 2 sets the map validity information of the map management information to 

20 "valid," and sets the self-encoding flag ON. 

[0138] Original playback information (0_PGC information, see Figs. 16A and 16B) in which the object to be recorded 
is one of the playback objects is generated as the playback control information. This 0_PGC information is added to 
the original program chain infromation table. The original program chain inf romation (0_PGC information) contains cell 
information. The cell information Type is set to PS_VOB. 

25 [0139] The system controller 212 then instructs the drive 221 to stop recording data accumulated in the track buffer 
220 and to record the VOB information (PS_VOBI) for PS_VOB and playback control information. The drive 221 thus 
records this information and the remaining data in the track buffer 220 to the optical disc 1 00, and the recording process 
ends. 

[0140] It will be obvious that an analog broadcast could be encoded to TS1_VOB. In this case the encoder 214 must 
30 be an encoder for converting the analog signal to a digital signal and encoding the digital signal to the MPEG transport 
stream, and the type information in the cell information is set to TS1_VOB. The PTS or PCR can be used for the Start. 
PTM and End.PTM. 

7.2 Recording by outside encoding 

35 

[0141] Recording with outside encoding is described next below with reference to recording a digital broadcast. The 
recorded object type in this case is TS2_VOB. 

[0142] A digital broadcast recording request from the user is passed from the user interface 222 to the system controller 
21 2. The system controller 21 2 then instructs the digital broadcast tuner 215 to receive and instructs the analyzer 21 6 
40 to analyze the received data. 

[01 43] An M PEG transport stream sent from the digital broadcast tuner 21 5 is passed through the analyzer 21 6 to the 
track buffer 220. 

[0144] To generate the VOB information (TS2_VOBI) of the encoded MPEG transport stream (TS2_VOB) received 
as a digital broadcast, the analyzer 21 6 first extracts the time stamp data at the beginning of the transport stream as the 
45 start time information (TS2_VOB_V_S_PTM) and sends it to the system controller 212. This start time value is set as 
the Start_PTM of the cell information shown in Fig. 17 and generated later. The time stamp information is the PCR or 
PTS. The timing at which the object is sent to the DVD recorder could alternatively be used. 

[0145] The analyzer 216 then analyzes the system layer of the MPEG transport stream to detect the information 
needed for access map generation. The l-picture locations in the object are detected based on the random access 
so indicator (random_access_indicator) in the adaptation field of the TS packet header as described above, or the unit start 
indication information (payload_unit_start_indicator) in the TS packet header. 

[0146] The system controller 212 then outputs a record command to the drive 221, and the drive 221 thus extracts 
and records data accumulated in the track buffer 220 to the DVD-RAM disc 1 00. The system controller 21 2 also instructs 
the drive 221 where to record on the disc based on the file system allocation data. A contiguous data area (CDA) as 
55 described above is also found in the recordable area of the disc and the data is recorded to the located contiguous data 
area. 

[01 47] Recording typically ends when the user inputs a stop recording command. Stop recording commands from the 
user are input through the user interface 222 to the system controller 212, and the system controller 212 then sends a 
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stop command to the digital broadcast tuner 215 and analyzer 21 6. 

[0148] In response to the received stop command from the system controller 212, the analyzer 216 stops analyzing 
the received data and sends the time stamp data at the end of the last analyzed MPEG-TS to the system controller 21 2 
as the playback end time (TS2_VOB_V_E_PTM). This value is set as the End_PTM of the cell information shown in 
5 Fig. 17. The PCR or PTS is used for the time stamp information but the timing when the object was sent to the DVD 
recorder can be used instead. 

[01 49] After ending the digital broadcast reception process, the system controller 21 2 generates the playback control 
information and VOB information (TS2_VOBI) for the TS2_VOB as shown in Fig. 15 based on the information received 
from the analyzer 21 6. 

w [0150] The VOB information generated here includes map management information and an access map appropriate 
to the object type. The system controller 212 sets the map validity information of the map management information to 
"valid" when the l-picture locations in the objects were detected and the access map could be generated. The self- 
encoding flag is set OFF. When a valid access map could not be generated the map validity information is set to an 
"invalid" state. Examples of when a valid access map cannot be generated include when a corresponding digital broadcast 

15 is not received and when there is no random access information set in the adaptation field. When the signal is input 
directly through the digital interface the signal may also not be an MPEG transport stream, and in this case, too, the 
map validity flag is set to "invalid." 

[0151] Original playback information (0_PGC information) as shown in Figs. 16A and 16B for the recorded object as 
one of the playback objects is generated as the playback control information. This 0_PGC information is added to the 
20 original program chain information table. The original program chain information (0_PGC information) contains cell 
information of which type information is set to "TS2_VOB". 

[0152] The system controller 212 then instructs the drive 221 to stop recording data accumulated in the track buffer 
220 and to record the VOB information (TS2_VOBI) forTS2_VOB and playback control information. The drive 221 thus 
records this information and the remaining data in the track buffer 220 to the optical disc 100, and the recording process 
25 ends. 

[01 53] While the above recording operations are described with reference to recording start and end commands input 
by the user, it will be obvious that the same essential operation applies to timer recordings such as used in a VCR, for 
example. In this case the system controller automatically issues the recording start and end commands instead of the 
user, and there is no essential change in DVD recorder operation. 

30 

8. Main concept of the invention 

[01 54] A data recording medium according to the present invention is a medium for recording data of various different 
formats, including analog broadcast or digital broadcast content and various types of data input through an analog/digital 
35 interface. A data recording apparatus according to the present invention is an apparatus for recording AV data to the 
same data recording medium, 

[0155] More particularly, externally input AV data is recorded as an MPEG-TS, and a stream adding decoder input 
time information (time stamp information) for each MPEG-TS packet to each MPEG-TS packet is recorded to the data 
recording medium of the present invention. The time stamp information added to the MPEG-TS packets and the time 

40 stamp information added to the MPEG-PS packs after conversion are correlated based on a specific relation. 

[01 56] Fig. 20 shows an MPEG transport stream (MPEG-TS) and the conversion of an M PEG-TS to an M PEG program 
stream. As shown in this figure the MPEG-TS contains a PS I (Program Specific Information) packet containing MPEG- 
TS control information, places recorder-specific and content-specific information in a private use stream (Tip packet), 
and records the decoder input time (ATS) for each packet in a format appropriate to accumulation. 
[0157] For easier conversion from a multiplexed MPEG-TS to an MPEG-PS, a specific number (one or more) of MPEG- 
TS packets are system-encoded to one continuous unit for multiplexing (multiplexing unit) for recording as an MPEG- 
TS. One multiplexing unit is determined so that data amount of one multiplexing unit corresponds to that of one MPEG- 
PS data pack. Introducing this multiplexing unit concept makes it simple to convert from an MPEG-TS to MPEG-PS by 
simply converting MPEG-TS packets in multiplexing units to MPEG-PS video packs or audio packs, and an MPEG-TS 

50 can therefore be easily converted to an MPEG-PS. 

9. Detailed description of the embodiments 
9.1 Encoder configuration 

55 

[0158] The encoder of a data recording apparatus according to the present invention is described next below using 
by way of example self -encoding AV input to an MPEG-TS. 

[0159] The configuration of an encoder in a data recording apparatus according to the present invention is shown in 
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Fig. 21. The encoder receives video, audio, and vertical blanking interval (VBI) signals to encode them to a transport 
stream. 

[01 60] Encoder operating modes include a DVD-Video compatibility mode, DVD Video Recording compatibility mode, 
and a normal mode. The encoder generates an MPEG-TS that can be easily converted by the method further described 

5 below to the DVD-Video standard when in the DVD-Video compatibility mode, generates an MPEG-TS that can be easily 
converted by the method described below to the DVD Video Recording ("DVD VR" below) standard when in the DVD 
Video Recording compatibility mode, and generates an MPEG-TS having specific attributes when in the normal mode. 
When recording in the normal mode it is acceptable to use audio coding methods other than those defined by the DVD 
standards, and tolerance values in the video coding method (such as the GOP length) may be outside the range of 

w values defined by the DVD standards. 

9.2 Self-encoded MPEG-TS 

[0161] A preferred embodiment of an MPEG-TS format self-encoded by a data recording apparatus according to the 
w present invention is described below, specifically describing the differences between a normal MPEG-TS (below labelled 
"SESF") and an MPEG-TS that can be easily converted to an MPEG-PS (below called a "Constrained SESF"). 
[0162] In the example described below, each MPEG-TS stream stores information describing encoding conditions for 
the stream in a VOBI storing attribute information. By thus storing information describing encoding conditions in the 
management information, that is, outside the stream, it can be quickly determined whether or not the stream can be 
20 easily converted to a DVD-Video or DVD VR format without analyzing the stream. This information about the stream 
encoding conditions can be stored to the Tip packet (described below). 

[0163] These stream encoding conditions are stored in a 2-bit "encode_condition" flag. The values of this flag are 
defined below. 

00b: normal MPEG-TS (SESF) 
25 01 b: MPEG-TS that can be easily converted to a stream in DVD VR format (Constrained SESF) 
10b: reserved 

1 1 b: MPEG-TS that can be easily converted to a stream in DVD-Video format (Constrained SESF) 
[0164] Whether a stream can be easily converted to a DVD-Video or DVD VR stream format can thus be easily 
determined by reading the encode_condition field of the VOBI set as described above. It will be noted that "easily 
30 converted" as used herein means convertible by the method described below. 

9.3 Constrained SESF stream format 

[01 65] The stream format of a Constrained SESF transport stream is shown in Fig. 55. A Constrained SESF contains 
35 plural SESF capsules. A SESF capsule starts with a Tip packet (described in detail below) and contains a specified 
number of multiplexing units. The presentation time stamp (PTS) of each SESF capsule and the Tip packet address 
information is correlated by an address map. As will become clear below, in TS2PS conversion, a conversion process 
is performed for each SESF capsule. 

[0166] Fig. 20 shows the correlation between each packet in one SESF capsule and MPEG-PS packs. As shown in 
40 Rg. 20 a TS packet (referred to as a Tip packet below) storing specific information about the stream is inserted to a 
Constrained SESF. The Tip packets embedded in a Constrained SESF are described below with reference to Fig. 23 
to Figs. 29A and 29B. 

<Tip packet> 

45 

[01 67] Fig. 23 shows the complete structure of aTip packet. As shown here each Tip packet stores a Data JD identifying 
the packet as a Tip packet, display_and_copyJnfo corresponding to DCLCCI field of DVD VR and containing display 
control and copy control information, encodejnfo storing stream encoding information, and Makers Private Data storing 
data specific to the manufacturer and added by the manufacturer. 
50 [0168] As shown in Fig. 23 and Fig. 24, the PCR value needed for the SCR calculation further described below is 
written to the adaptation field of the Tip packet. This adaptation field is a fixed byte length and thus enables accessing 
various information in the Tip packet using a fixed address. 

[0169] Fig. 25 shows the DataJD field structure. The Data J D field contains a Datajdentrfier for identifying the packet 
as a Tip packet. This Datajdentifier is a 3-byte field storing value "0x544950" denoting TIP" in ASCI I code. The decoder 
55 of the playback drive can identify Tip packets by reading the value of this field. 

[0170] Fig. 26 shows the structure of the display_and_copy_info field. Generating the RDI packs when converting the 
Constrained SESF to the DVD VR format is facilitated by writing the same structure and information as the DCLCCI 
field of the RDI Unit in the DVD VR standard to this display_and_copy_info field. (It is noted that details about the DCI_ 
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CCI field of the DVD VR standard can be found in "DVD Specifications for Rewritable/Re recordable Disc, Part 3, Video 
Recording," and in Japanese Patent No. 31 62044. While some of the field names may be different in these documents, 
the field definitions are the same so as to enable direct conversion to the DVD VR format.) 

[01 71 ] Fig. 27 shows the structure of the encodejnf o field. The video_resolution field stores the resolution of the video 
5 stream following the Tip packet The values of this encode_info field are defined below. 

0000b: 720x480(NTSC), 720x576(PAL) 

0001 b: 704x480(NTSC), 704x576(PAL) 

0010b: 352x480(NTSC), 352x576(PAL) 

0011 b: 352x240(NTSC), 352x288(PAL) 
10 01 00b: 544x480(NTSC), 544x576(PAL) 

0101b: 480x480(NTSC), 480x576(PAL) 

Others: reserved 

[01 72] The DVD VR format allows the resolution to change during a single continuous recording. Streams of different 
resolutions are managed in separate VOBs, and seamless stream connections are assured during playback by the 
15 recorder. When the resolution changes during Constrained SESF recording, this video_resolution field is used to identify 
the point from which the VOB must change when converted to the DVD VR format. 

[0173] In a Constrained SESF recorded to facilitate conversion to the DVD-Video format (encode_condition = 1 1 b) a 
change in resolution within a stream is not permitted. 

[0174] The encode_condition field stores the same information stored in a VOBI. The reason why this information is 
20 stored not only in the stream management information but also embedded in the stream is that even if the stream is 

copied through a digital interface such as IEEE 1394 the recorder receiving the stream can easily determine if the stream 

can be easily converted to a DVD format by simply confirming the value of this encode_condition field in the Tip packet. 

The recorder can determine the encoding conditions of at least the elementary streams between one Tip packet and 

the next Tip packet (i.e., the SESF capsule) in the received stream. 
25 [0175] VOBU_S_PTM from the DVD VR standard is recorded to the FVFPST field. This is to eliminate the need to 

analyze the encoded video stream following the Tip packet to calculate the playback time of the first presented video 

field when converting a Constrained SESF to a DVD-Video or DVD VR format. 

[0176] The FVFPST field contains a 32-bit field denoting the video field presentation time with 90 kHz precision, and 
a 16-bit field denoting the presentation time with 27 MHz precision, which cannot be expressed in the 32-bit field. 
30 [0177] Fig. 28 shows the MakersPrivateData. As shown in Fig. 28 the MakersPrivateData contains a makerJD field 
identifying the manufacturer that generated the Constrained SESF, and a maker_private_data field containing other 
specific information added by the manufacturer. 

[01 78] Fig. 29A and 29B show exemplary stream_type values denoting the Tip packet PID and stream type. Because 
both PID and stream_type values are reserved by the MPEG and other standards, the values used are selected so as 
35 to not interfere with these reserved values and to denote private data outside the MPEG standard. 

[0179] It will thus be apparent that various stream attributes are extracted and stored to the Tip packet stored in a 
Constrained SESF. How the fields described above are used during conversion to a DVD format is described in further 
detail below. 

*o 9.4 System-encoding conditions 

[0180] System-encoding conditions for Constrained SESF are described next in detail. It should be noted that the 
following system-encoding conditions are not applicable to an SESF for which the encode_condition field is neither set 
to "01 b" or "1 1 b", that is, an SESF that is not a Constrained SESF. 

45 

Multiplexing unit> 

[01 81] Each TS packet storing the elementary streams of a Constrained SESF has a unit for multiplexing (multiplexing 
unit) of data stored in 2-KB packs according to a DVD format. 

so [0182] Why this concept of multiplexing units is introduced is described briefly with reference to Figs. 57A and 57B. 
Fig. 57A describes converting an MPEG-TS in an unconstrained format to an MPEG-PS. To convert an MPEG-TS to 
an MPEG-PS the multiplexing order of the TS packets (video packets and audio packets) that are the multiplexing units 
of the MPEG-TS must be changed so that each pack in the MPEG-PS contains only one type of data. This is because 
the TS packets (188 bytes) that are the MPEG-TS multiplexing unit are smaller than the packets (2 KB) that are the 

55 MPEG-PS multiplexing unit. More specifically, it is necessary to collect and stuff only video packets from the MPEG-TS 
to MPEG-PS video packs (V_PCK), and collect and stuff only audio packets from the MPEG-TS to MPEG-PS audio 
packs (A_PCK). As shown in Fig. 57 (a), the multiplexed sequence of audio packets (A packets) storing audio data in 
the MPEG-TS Is changed in the converted MPEG-PS and stored in audio pack A_PCK #1 at the end of the stream. 
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[0183] Fig. 57 (b) describes converting a constrained format MPEG-TS to an MPEG-PS. In this constrained format 
eleven consecutive TS packets are managed as one multiplexing unit. The total amount of data stored in one multiplexing 
unit is determined so as to not exceed the amount of data stored in one pack. It should be noted that the amount of data 
(or data size) referred to here does not include the pack or packet header information and means only the video or audio 
5 data. Furthermore, the eleven consecutive TS packets managed as one multiplexing unit all store the same type of data, 
video or audio. 

[0184] It will thus be obvious that by introducing multiplexing units as described above it is not necessary to change 
the multiplexing sequence of the TS packets that are the MPEG-TS multiplexing units when converting a constrained 
format MPEG-TS to an MPEG-PS. 

to [0185] As shown in Fig. 20, all TS packets stored in one multiplexing unit store only one type of elementary stream 
and TS packets storing different types of elementary streams are not stored in a single multiplexing unit. It will also be 
noted that it may be necessary for a multiplexing unit (such as the multiplexing unit storing the last part of the stream) 
to also store a null packet, and mixing a null packet into a multiplexing unit is therefore not prohibited. Including a null 
packet is also necessary to clarify the relationship between multiplexing unit and packs. 

15 [0186] One multiplexing unit thus contains 1 1 consecutive TS packets, and the elementary stream (payload data) in 
each multiplexing unit is completely stored to the one corresponding pack. This similarly constrains the pack relationship. 
[0187] The TS packet storing the PES packet header is the first TS packet in a multiplexing unit. This correlates the 
packet header of the pack (called the PES packet header in the MPEG-TS) to the PES packet header in the Constrained 
SESF, and enables the consecutive TS packets to be easily converted in sequence. 

20 [0188] When the PES packets storing the video stream are divided between multiple multiplexing units, all multiplexing 
units other than the multiplexing unit containing the last byte of the PES packet store 2024 bytes (= 184 x 11) of TS 
packet payload data. This enables the most efficient stream transfers and makes sequential processing by TS packet 
unit easier during TS2PS conversions. If multiplexing units other than the last multiplexing unit are allowed to contain 
less than 2024 bytes, it will not be possible to determine the value of the PES_packet_1ength field stored to the packet 

25 header of each MPEG-PS pack on-the-fly when converting the first TS packet in a multiplexing unit during TS2PS 
conversion. 

[0189] A PES packet storing an audio stream starts at the first TS packet in one multiplexing unit and ends within that 
multiplexing unit. This is easy to understand if storing a PES packet storing an audio stream to multiple multiplexing 
units is considered. If one audio PES packet is divided between multiple multiplexing units, the internal structure of the 
30 audio stream will need to be analyzed when converting the second and subsequent multiplexing units to MPEG-PS 
packs because the PTS must be determined or the number of audio frames in one pack must be determined in order 
to generate the packet header. 

[01 90] A multiplexing unit is thus defined as described above. An encoderthat generates a Constrained SESF performs 
the system-encoding with constraints of the multiplexing unit described above. 

35 

9.5 Constraints on the PES packet header in a Constrained SESF 

[0191] Some constraints on the field values of the PES packet header in a Constrained SESF are described next. 
[0192] As shown in Fig. 30 some PES packet header fields allow only fixed values. This is to prevent the need for 
40 unnecessary processing during conversion to a DVD format. "Unnecessary processing" as used here means processing 
fields that are added or deleted by values differing from values defined by the DVD format. In other words, the object of 
these constraints on the PES packet header is to minimize the fields that are added to or deleted from the header during 
TS2PS conversion. 

[0193] It should be noted that the PES_packet_length field can be set to 0 in a PES packet storing MPEG-TS video. 
45 The value stored to the PES_packetJength field must therefore be calculated during TS2PS conversion from the packet 
header length stored to the pack and the byte length of the payload data. 

[0194] The PTS_DTS_flags field denotes whether the PTS or DTS is defined. The PTS_DTS_flags field value in the 
Constrained SESF is set according to the following rules. 

[0195] If the PES packet stores a video stream, PTS_DTS_flags is set to 1 1 b under the following conditions: 

so 

1) A frame-encoded l-picture is stored to the PES packet; 

2) A frame-encoded P-picture is stored to the PES packet; 

3) A pair of field-encoded l-pictures are stored to the PES packet; 

4) A pair of field-encoded P-pictures are stored to the PES packet; or 

55 5) A field-encoded l-picture is followed by a field-encoded P-picture In the PES packet. 

[0196] If the PES packet stores an audio stream, one or more audio frames always starts in the PES packet and PTS_ 
DTSJlags is set to 1 0b (11 b if the DTS is defined). 
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[0197] Constraints are also applied to the PES_extension_flag and PES_header_data_length fields to enable sequen- 
tial processing by TS packet unit during TS2PS conversion. These constraints are shown in Fig. 31 . 
[01 98] As shown in Fig. 31 the field values are defined according to the type of elementary stream, PES packet location, 
and encode_condition value. 

[0199] V1 in Fig. 31 is the sum of the byte length of the PTS field and DTS field in the PES packet. That is, 
if PTS_DTS Jlags = 00b, V1 = 0; 
if PTS_DTS_flags = 10b, V1 = 4; 
if PTS_DTS_flags = 11 b, V1 = 10. 

[0200] This constraint is necessary to enable sequential processing by TS packet when converting to DVD-Video or 
DVD VR, instead of compiling the packs after determining the payload length of each pack, as noted above, 
[0201] The PES packet header is thus defined as described above. An encoder that generates a Constrained SESF 
performs system-encoding with the constraints described above. 

9.6 Constraints on the Tip packet insertion interval 

[0202] Constraints relating to the insertion interval of Tip packets inserted to a Constrained SESF are described next. 
[0203] The decoder input time denoted by the Tip packet ATS (ATS1) and the decoder input time indicated by the 
ATS (ATS2) of the Tip packet storing the video or audio stream first input to the decoder after the Tip packet must be 
related as follows. 



ATS1 +T <= ATS2 

where T is the minimum transfer time of a PS pack, This minimum transfer time T is the shortest time from the start to 
the end of PS pack input to the decoder. In other words, the above equation shows that the ATS interval of each TS 
packet must be greater than the interval enabling at least the converted PS pack to be input to the system decoder. T 
is obtained from the next formula. 

T = (PS jDack_size*8*system_clock_frequency) / PSrate 

PS_pack_size is the byte length of one MPEG-PS pack generated by TS2PS conversion, system_clock_frequency is 
the frequency of the MPEG-PS decoder reference clock, and PSrate is the multiplex rate of the MPEG-PS generated 
by TS2PS conversion. 

[0204] PS_pack_size, system_clockJrequency, and PSrate are further defined as follows by the DVD format: 

PS_pack_size = 2048 bytes 

system_clock_frequency = 27,000,000 Hz 

PSrate = 10,080,000 bits/second. 

The relationship between ATS1 and ATS2 is therefore: 

ATS1 +43885.714... <= ATS2 

and ATS1 + 438B6 = ATS2 is therefore the minimum value of ATS2. 

[0205] More specifically, the TS2PS conversion described below converts a Tip packet to a 2 KB NV_PCK (when 
converted to DVD-Video) or RDI_PCK (when converted to DVD VR), and if the above equation is not satisfied transfer 
of the next elementary stream starts sooner and could exceed the DVD system transfer rate of 10.08 Mbps. 
[0206] It should be noted that the same effect can be achieved by assuring the above-described interval between AV 
data transfers before and after each Tip packet, and the invention shall not be limited to inserting a period in which AV 
data is not transferred only after Tip packet transfer. 

[0207] An integer number of GOPs are aligned between two consecutive Tip packets (that is, in one SESF capsule). 
This is so that data from one Tip packet to the TS packet immediately before the next Tip packet (that is, SESF capsule) 
corresponds to VOBU in DVD format and the concept of a DVD format VOBU is thus also achieved in a Constrained 
SESF. VOBU in DVD format (such as DVD VR) must contain an Integer number of GOPs. 

[0208] The time on the playback time base from one Tip packet to the next Tip packet must be 0.4 second or longer 
and 1 .0 second or less. The playback time for the playback data following the last Tip packet must be 0.4 sec or longer 
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and 1 .2 second or less if encode_condition = 1 1 b (DVD-Video or DVD VR mode), and 1 .0 second or less if encode_ 
condition = 01 b (DVD VR mode). This is because a Tip packet means the start of a VOBU and to conform to each DVD 
format. 

[0209] The access map fortime-address conversion points uniquely (1 :1 ) to each Tip packet. This is so that conversion 

5 can start immediately by VOBU unit in DVD format during TS2PS conversion. 

[0210] It should be noted that it is not necessary for the access map to point to every Tip packet. For example, the 
AV data following the last Tip packet in a Constrained SESF is handled differently from other Tip packets since it is 
different from other Tip packets, for example, different playback time, lack of following by a next Tip packet and so on, 
No problem with playback or conversion is thus created by not registering the last Tip packet in the access map, and 

10 can therefore be handled as an exception with consideration for the hardware configuration of the recorder. It is also 
possible that the access map does not point to each Tip packet because of such external factors as limitations on the 
size of the access map. 

[021 1] Constraints on the Tip packet insertion interval are thus defined as described above. An encoder that generates 
a Constrained SESF performs the system-encoding within the constraints described above. 

15 

9.7 Constraints relating to decoder control 

[021 2] Constraints relating to Constrained SESF decoder control, specifically buffer management, are described next 
below. 

20 [0213] A Constrained SESF must be generated to satisfy the standards defined by the standard decoder model T_ 
STD in MPEG-TS, This enables an STB, for example, having a T_STD-conforming decoder to decode the Constrained 
SESF insofar as the stream type is compatible. 

[0214] The MPEG-TS standard decoder model T_STD and the MPEG-PS standard decoder model P_STD are sub- 
stantially identical in operation and processing capacity, but differ in the input rate of the audio stream to the decoder. 
25 More specifically, referring to Fig. 18, except for AAC (Advanced Audio Coding), the transfer rate to the audio buffer 
from the transfer buffer before the audio decoder is a fixed 2 Mbps in the T_STD. The P_STD, however, can input each 
stream to the decoder at the system rate, which with DVD is 1 0.08 Mbps. 

[0215] This means the same buffer management cannot be used for a Constrained SESF and DVD format. 
[0216] While the same buffer management thus generally cannot be used for a Constrained SESF and DVD format, 
30 extremely fast, simple conversion can be achieved when converting a Constrained SESF to DVD format without repeating 
the system encoding process if the SCR (System Clock Reference) denoting the decoder input start time of the converted 
packs can be calculated using the ATS assigned to each TS packet. Calculation of the SCR using the ATS is described 
in further detail below. 

[021 7] A Constrained SESF according to the present invention must be pre-encoded so that it conforms to the T_STD 
35 and so that the MPEG-PS generated by the below-described conversion process conforms to the P_STD. 

[0218] In other words, a Constrained SESF is a stream encoded to an MPEG-TS so that it also conforms to the P_ 
STD when converted by the process described below to an MPEG-PS. 

[021 9] The constraints relating to buffer management for a Constrained SESF are described above. It should be noted 
that an SESF is simply encoded to conform to the T_STD without being aware of the above constraints. 
*o [0220] Examples of an MPEG-TS and MPEG-PS not conforming to the T_STD and P_STD models are described below. 
[0221] An MPEG-TS self-encoded to enable conversion to an MPEG-PS but not conforming to the T_STD model is 
described first with reference to Fig. 32. 

[0222] Stream TS1 is an MPEG transport stream that is system-encoded according to the T_STD model. Stream TS2 
is an MPEG transport stream that does not conform to the T_STD model. 

45 [0223] More specifically, the values of ATS[47] to ATS[57] in stream TS2 are set to exceed the allowable audio data 
transfer rate of an MPEG-TS. This causes the audio transport buffer (see Fig. 18) to overflow, thus not meeting the 
requirements of the T_STD model. The values of ATS[47] to ATS[57] in stream TS1 , however, are set to an audio data 
transfer rate allowed in the MPEG-TS. This stream can thus be correctly converted to an MPEG program stream PS1 
conforming to the P_STD model by the SCR conversion formula described below. Stream TS2 also does not conform 

50 to T_STD but can be converted to PS1 using the SCR conversion formula described below. In order to convert stream 
TS2 to MPEG-TS conforming to T_STD , the transfer time interval of the audio packets specified by ATS[47] to ATS[57] 
must be spread out so that a transport buffer overflow does not occur. 

[0224] An example where the MPEG-TS conforms to the T_STD model but the MPEG-PS converted from the MPEG- 
TS does not conform to the P_STD model is described next with reference to Figs. 33A and 33B. Stream TS3 is an 
55 MPEG transport stream and steam PS3 is an MPEG program stream converted from MPEG transport stream TS3. Fig. 
33 (b) shows the change in the video data buffer state when decoding each. The PES #1 picture decoding time is SCR 
[2], and PES #2 picture decoding time is between SCR [4] and SCR [5]. 

[0225] As shown in Fig. 33B, In transport stream TS3, data transfer of PES #1 and PES #2 completes by a start of 
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decoding picture data in PES #1 and PES #2. In program stream PS3, however, V_PCK #1 data is transferred successfully 
to the decoder for PES #1 , but when decoding PES #2 V_PCK #4 data is not transferred in time and a buffer underflow 
occurs because decoding began before data transfer was completed. The program stream therefore does not conform 
to the P_STD model. To avoid this and assure that PES #2 transfer is completed In time, the ATS (ATS [14], ATS [25], 
5 ATS [36]) of each TS packet in the MPEG-TS converted to V_PCK #2 to V_PCK #4 can be shifted to a time before PES 
#2 picture data is decoded. 

[0226] Because buffer management preventing buffer underflow and overflow states is thus necessary for both the 
encoded MPEG-TS and MPEG-PS converted therefrom, it is necessary when encoding the MPEG-TS to anticipate both 
the encoded MPEG-TS and the MPEG-PS converted therefrom. 

w [0227] Figs. 58A and 58B describe buffer management for the MPEG-TS and anticipated MPEG-PS when the MPEG- 
TS before conversion and the MPEG-PS after conversion have the same bit rate. Buffer management of the anticipated 
MPEG-PS can be assumed in the present embodiment to be the same as buffer management for the encoded MPEG- 
TS. This is because the time stamp information (calculated_PCR) set in the multiplexing units of the MPEG-TS converted 
to an MPEG-PS and the time stamp information set in the MPEG-PS packs after conversion are the same. 

is [0228] Fig. 58A shows an example in which a buffer underflow occurs. Data transfer of the encoded MPEG-TS is not 
completed by the target time K1 , that is, the DTS timing. It is therefore anticipated that data transfer of the converted 
MPEG-PS will also not be completed in time. 

[0229] To avoid this buffer underflow state the MPEG-TS time stamp information must be set so that data transfer is 
completed by time K1 , the DTS timing, as shown in Fig. 58B. It can thus be anticipated that a buffer underflow will also 

20 not occur with the converted MPEG-PS. 

[0230] Figs. 59A and 59B describe buffer management for the MPEG-TS and anticipated MPEG-PS when the bit rate 
of the MPEG-TS being converted is higher than the resulting MPEG-PS. Buffer management of the anticipated MPEG- 
PS cannot be assumed to be the same as buffer management of the encoded MPEG-TS in this case. Separate buffer 
management is therefore required for the MPEG-PS. 

25 [0231] A buffer underflow occurs only with the MPEG-PS in the case shown in Fig. 59A. Data transfer is completed 
by the target time K1 (DTS timing) with the encoded MPEG-TS, and a buffer underflow does not occur. Data transfer is 
not completed by target time K1 (DTS timing) with the converted MPEG-PS, however, and a buffer underflow occurs. 
Therefore, data transfer must also end by time K1 , the DTS timing, in order to avoid a buffer underflow with the MPEG- 
PS. If the resulting MPEG-PS is an MPEG-PS used by the DVD standard, the system transfer rate cannot be increased. 

30 it is therefore necessary to reduce the image rate as shown in Fig. 59B or otherwise reduce the total amount of transfer 
data in order to ensure that the data can be completely transferred to the decoder in time. 

<ATS-SCR conversion> 

35 [0232] A method for determining the SCR of the PS packs when a Constrained SESF stream is converted to a program 
stream is described next. It should be noted that because the SCR is calculated when new packs are generated, the 
SCR must be calculated only when converting the first TS packet in the multiplexing unit. 

[0233] The basic concept for determining the SCR is described first. The time stamp information (SCR) set to the 
packs of the MPEG-PS after conversion is shown in Figs. 60A and 60B for two different cases. 
40 [0234] Fig. 60A shows a case in which the bit rate is the same in the MPEG-TS and MPEG-PS. In this case the same 
value as the time stamp information (calculated_PCR) set in the corresponding multiplexing unit of the MPEG-TS is set 
to the time stamp information (SCR) of the MP EG -PS packs. 

[0235] Fig. 60B shows a case in which the MPEG-TS transfer rate is higher than the MPEG-PS transfer rate. In this 
case the buffer input completion time (SCR[i-1]+T) of the immediately preceding pack is set to the SCRft] of each pack 

45 (V_PCK) in the program stream after conversion. Why the SCR is thus set is described below. 

[0236] If the calculated_PCR[i] of the multiplexing unit corresponding to SCR[i] is set as in the case shown in Fig. 60A, 
a time earlier than the buffer input completion time (SCR[i-1]+T) of the immediately preceding pack will be set to the 
SCR. If the SCR is set to this timing the stream will be unplayable by current DVD recorders, and this must be avoided. 
It should be noted that the MPEG-TS bit rate is set higher than in the MPEG-PS because the maximum transfer rate is 

50 slower for audio than for video. 

[0237] Determining the SCR is described in further detail below. 

[0238] In Constrained SESF stream, as shown in Rg. 55 a SESF capsule includes a Tip packet and a predetermined 
number of TS packets composing Multiplexing Unit. Si nee transferred in sync with a decoder reference time STC (System 
Time Clock), the stream includes PCR packet to reset STC. 
55 [0239] As shown in Rg. 1 4 each TS packet is added with a first time stamp information (ATS) indicating a transferred 
time to the decoder. A reference time of this first time stamp information (ATS) is different from the decoder reference time. 
[0240] Thus, Tip packet contains second time stamp information (PCR Jip) based on the decoder reference time as 
well as the first time stamp information (ATSJip) based on the same reference time as the TS packet. By referring to 
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Tip packet, the decoder can calculate the second time stamp information (PCR) from the first time information (ATS) of 
each TS packet. 

[0241] As shown in Fig. 61 , the second time stamp information calculated from the first time stamp information (ATS 
p]) of each TS packet located at the head of Multiplexing unit becomes second time map information for each Multiplexing 
5 unit (referred to as "calculated PCRp]" below). 

[0242] For example, the PCR value (PCRp]) of a TS packet is obtained from the following equation using the PCR 
(PCRJip) and ATS value (ATS Jip of the first Tip packet in an SESF capsule and the ATS value ATSp] of the next TS 
packet if carry (column overflow) of the ATS value is not considered. 

10 

PCRp] = PCRJip + (ATSp] - ATS Jip) 

To determine calculated_PCR[1] which indicates the decoder input time of the first multiplexing unit in case of Fig. 61 , 
for example, the following formula can be used. 

75 

calculated_PCR[1] = PCR[2] 

= PCRJip + (ATS[2] - ATS Jip). 

20 

The calculated_PCR for each multiplexing unit is likewise calculated while considering ATS column overflow. 
[0243] Fig. 34 shows the relationship between the calculated_PCR and SCR when converting from a Constrained 
SESF to MPEG-PS, and shows the first part of the capsule shown in Fig. 55. The ATS assigned in ascending order from 
the stream start to the TS packets at the beginning of each multiplexing unit are denoted in Fig. 34 as ATS[k]. The same 
25 notation is used for the calculated J>CR and SCR. The PCR value calculated in the order of appearance for the first TS 
packet in each multiplexing unit is shown as the calculated_PCR [i] (where i = 1 , 2, ...). The SCR of the converted packs 
is likewise denoted SCR [i]. 

[0244] As described above the maximum video stream transfer rate allowed by the T_STD model is 1 5 Mbps (the 
transfer rate from the multiplex buffer to the video buffer can not exceed 15 Mbps for the MP® ML, and the audio stream 

30 input rate is limited to a rate lower than the video, (transfer rate from a transport buffer to an audio buffer does not exceed 
2 Mbps except for AAC). Thus unlike a multiplexing unit storing video data, a multiplexing unit storing audio data is 
therefore transferred at a low rate. Therefore, if the video data transfer rate is to be raised to near the 9.8 Mbps maximum 
transfer rate of the DVD format, video data TS packets must be transferred at a higher rate than the DVD transfer rate 
(1 0.08 Mbps) in order to assure sufficient transfer time for the audio data, which has a lower transfer rate and therefore 

35 takes longer. 

[0245] As will be known from Fig. 34, the transfer times of the Constrained SESF and DVD format differ. 
[0246] The following relation must be true between the decoder arrival time calculated_PCR of the first TS packet in 
a multiplexing unit and the SCR of the packs converted therefrom. 
SCR[1] = calculated_PCR[1] 
40 sCR[i] = max(SCR[i-1] + T, calculated_PCR[i]) (i= 2, 3,...) 
calculated J^Rp] = PCRJip + (ATSp] - ATS Jip + WA'BS) 
T = PS_pack_size*8*system_clock Jrequency / PS rate 

where PCRJip and ATS Jip are the PCR value of the Tip packet immediately before the multiplexing unit being converted 
and the ATS value of that Tip packet; WA indicates how many times an overflow occurred at the ATS between ATS Jip 
45 and the ATS (ATS[i]) assigned to the first TS packet in the i-th multiplexing unit. More specifically, the ATS value is 
expressed as a finite bit count, there is a limit to the number that can be expressed, and overflow can occur. How many 
times such an overflow occurs is expressed with WA. BS is the data size corresponding to one ATS overflow. Function 
max(a,b) is a function for selecting the greater of a and b. 

[0247] Furthermore, PS_pack_size in the SCRp] (i= 2, 3,...) relation is, as noted above, the byte length of one pack 
so in the MPEG-PS output by the TS2PS conversion process; system_c!ockJrequency is the frequency of the MPEG-PS 
decoder reference time; and PSrate is the multiplex rate of the MPEG- PS generated by the TS2PS conversion. More 
specifically, 

PS_pack_size = 2048 bytes 
system_clockJrequency = 27,000,000 Hz 
55 PSrate = 1 0,080,000 bits/second. 

[0248] There are two patterns for outputting packs after the first pack: outputting the packs after waiting a minimum 
transfer time determined by the transfer rate from the output time of the preceding pack, and outputting at the decoder 
input time of the first TS packet in a pack. The former method of leaving the minimum transfer time before pack output 
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is selected when the packs are output before converting the video data to the DVD format. For example, if the packs 
are output at a time before the video data is converted to the DVD format, the former method of transfer after waiting a 
minimum transfer time determined by the transfer rate from the output time of the preceding pack is selected. 
[0249] A program stream obtained by TS2PS conversion must conform to the P_STD model as described above, and 
5 the SCR is therefore limited to values within a certain range. Therefore, the ATS value assigned to each packet of the 
Constrained SESF must be set with reference to the above ATS-SCR relation. 

9.8 Constraints relating to the elementary stream 

10 [0250] Constraints relating to the elementary stream of the Constrained SESF are described next. 

[0251] Because re-encoding the elementary streams is a very demanding process, only MPEG-2 Video is allowed for 
video data and AC-3, MPEG-1 Audio, and LPCM are allowed for audio data. 

[0252] The Constrained SESF omits LPCM so that re-encoding of elementary stream is not necessary and buffer 
management can be easier. Streams allowed for a Constrained SESF are therefore limited to only MPEG-2 Video for 
15 video data and only AC-3 and MPEG-1 Audio for audio data. 

[0253] Elementary stream attributes are shown in Fig. 35 when encode_condition = "1 1 b". 

[0254] Because the attributes shown in the figure are set to maintain compatibility at the elementary stream level to 
DVD-Video or DVD VR, a Constrained SESF (encode_condition = 1 1 b) set to these attributes does not require re- 
encoding the elementary stream for conversion to DVD-Video or DVD VR formats, and high speed conversion is therefore 
20 possible. 

[0255] Fig. 36 shows elementary stream attributes when encode_condition = "01 b". 

[0256] Because the attributes shown in the figure are set to maintain compatibility to DVD VR at the elementary stream 
level, a Constrained SESF (encode_condition = 01 b) which is set to these attributes does not require elementary stream 
re-encoding for conversion to DVD VR, and high speed conversion is therefore possible. 
25 [0257] Notes 1 to 4 in Fig. 35 and Fig. 36 are described next. 

Note 1 : This attribute cannot change inside the same VOB. 

Note 2: This attribute can change in the TS packet storing the first elementary stream following the Tip packet. In 
other words, it can change only in the first video or audio TS packet in the SESF capsule. 
30 Note 3: sequence_end_code cannot be inserted between sequencejieaders where the horizontaLsize, verticaL 

size, and aspect_ratio_information are the same. 
Note 4: This attribute can change inside the same VOB. 

[0258] Constraints relating to the elementary streams of a Constrained SESF are defined above. 
35 [0259] It should be noted that by applying the encoding conditions defined above a Constrained SESF enabling fast, 
simple conversion to DVD format can be produced. 

[0260] Fig. 37 is a flow chart of the process for generating program stream packs from TS packets (multiplexing units) 
storing AV data. 

[0261] As shown in the figure, a TS packet of a Constrained SESF storing AV data is converted to a 2 KB MPEG-PS 
*o pack storing AV data using one multiplexing unit as the processing unit. This process is described step by step below. 
[0262] (Step S4200) One TS packet is read from the Constrained SESF stream conversion starting point. 
[0263] (Step S4201) Whether the read TS packet contains AV data and is the first TS packet in a multiplexing unit is 
determined. 

[0264] Whether AV data is contained is determined by referencing the PID value of the TS packet which is declared 

45 by the PMT to be storing AV data. 

[0265] If the preceding TS packet is a Tip packet, PSI/SI packet, or PGR packet, the TS packet thereafter that contains 
AV data is known to be the first TS packet in the multiplexing unit. Because the conversion starting point is assumed to 
be a Tip packet, the beginning of a multiplexing unit can be detected by sequentially reading the TS packets (in other 
words, the first TS packet that is just after the Tip packet and contains AV data is always the beginning of a multiplexing 

so unit). 

[0266] If it is determined that the TS packet is not the beginning of a multiplexing unit, or if conversion does not start 
from a Tip packet and the beginning of the multiplexing unit cannot be identified, control returns to step S4200 to read 
the nextTS packet. 

[0267] If the beginning of a multiplexing unit is found, control advances to the next step. 
55 [0268] It should be noted that while not shown in this flow chart, conversion using the above-described SCR conversion 
method cannot be applied even for a TS packet at the beginning of the multiplexing unit if the Tip packet was not 
previously located, but conversion can proceed using the PCR packet instead of a Tip packet. 
[0269] (Step S4202) Using the ATS assigned to the first TS packet in the multiplexing unit, the decoder input time 
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(SCR) of the MPEG-PS pack converted from that TS packet is calculated. This SCR is calculated as described above. 

Once the SCR is determined the pack header shown In Fig. 38 is complete. This is because other than the SCR only 

fixed values are allowed in the pack header. 

[0270] (Step S4203) The packet header is then generated. 
5 [0271] The packet header is generated based on the PES packet header of the Constrained SESF. The resulting 

packet header must be formatted with the field values shown in Fig. 39. This is because if the header length or other 

field values are not constant the conversion from a Constrained SESF will not be constant and buffer management will 

be affected. It should be noted that fields not shown here store constant values and are therefore omitted here. 

[0272] The field values of the PES packet header are defined in detail in the Constrained SESF to minimize the 
10 processing required for conversion from a PES packet header (MPEG-TS) to MPEG-PS packet header. 

[0273] If the size of one PES packet is large relative to the size of one pack, one PES packet is converted to multiple 

packs. In this case, the followings to the packet headers of the second and subsequent packs are modified: PTS_DTS_ 

flags in the first packet header generated from the PES packet Is set to 00b; PES_extensionJlag is set to 00b; the 

stuffing_byte length is adjusted; and PES_header_data_length is corrected. 
*5 [0274] The packet headers are thus generated by modifying partially the first packet header from the PES packet 

header, and modifying partially the second and later packet headers from the first packet header. 

[0275] (Step S4204) The payload part of the TS packet is then simply copied sequentially from the beginning of the 

payload in the PS pack. 

[0276] (Steps S4205 to S4207) These steps simply repeat until the multiplexing unit is completed (i.e., for the 1 1 TS 
20 packets). Because a null packet could be inserted, the PID (0x1 FFF) of the null packet is confirmed and the TS packet 
payload data copied. 

[0277] Only the TS packet storing the last data in one PES packet is preferably defined to have an adaptation field. 
As a result, all TS packets except for the TS packet storing the last data in the one PES packet store a fixed-length 
payload containing 184 bytes, and reading the payload data is thus easier. 
25 [0278] (Step S4208) The byte length of the resulting program stream pack is then calculated when copying to the end 
of the multiplexing unit payload data is completed. If the pack length is 2048 bytes the pack is completed. If the byte 
length is not 2048 bytes, control steps to step S4209. 

[0279] (Step S4209) If the pack is less than 2048 bytes padding packets are added to the end of the payload so that 
the pack length is 2048 bytes. 

30 [0280] Conversion from a multiplexing unit storing AV data is as described above. This process simply repeats only 
when a multiplexing unit is detected until processing the designated part for conversion of the Constrained SESF ends. 
[0281 ] The conversion process described above is described in further detail below for processing the different types 
of packs. 

35 <Conversion to video packs (V_PCK)> 

[0282] Figs. 40 A and 40B show the conversion from a Constrained SESF to MPEG-PS. As shown in 40A, one video 
PES packet is normally larger than 2 KB, and is therefore typically divided into multiple multiplexing units for multiplexing 
to the Constrained SESF. 

40 [0283] Except for the last multiplexing unit in one video PES packet, the Constrained SESF is defined so that as much 
PES packet data as possible is stuffed into each multiplexing unit. Except for the last multiplexing unit, therefore, all 
multiplexing units store 2024 bytes (= 184 x 1 1 bytes) of data. 

[0284] By thus defining the Constrained SESF, the PES_packetJength and stuffing_byte fields can be predefined 
during TS2PS conversion. 

45 [0285] The last multiplexing unit storing data for one video PES packet fills the remaining area with an adaptation field 
and null packets to form one complete multiplexing unit. 

[0286] As shown in Figs. 40A and 40B, the multiplexing units in one video PES packet include the following three types. 
[0287] The first multiplexing unit storing the first data in the PES packet (MU #1 in the figure), multiplexing units storing 
data from the middle of the PES packet (MU #n where n = 2, 3, N-1 in the figure), and the multiplexing unit storing 
so the last data from the PES packet (MU #N). 

[0288] The resulting packs of the TS2PS converted MP EG- PS are as shown in Fig. 40B according to these multiplexing 
unit types. 

[0289] The pack converted from MU #1 Is always generated with at least 1 0 bytes of empty space, and therefore has 
a padding packet added to the end. 
55 [0290] If a space of 7 bytes or less is left in a pack in the DVD format, stuffing bytes (the last field in the packet header) 
are added to a total of 2048 bytes. If 8 or more bytes are empty, a padding packet is added. 

[0291] Packs converted from MU #n have one stuffing byte added to complete the pack. The pack converted from 
MU #N has a padding packet added because at least 8 bytes are usually empty when the pack is compiled. 
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<Conversion to audio packs A_PCK> 

[0292] Figs. 41 A and 41 B show conversion of a Constrained SESF to MPEG-PS. As shown in the figure one audio 
PES packet (storing one or more audio frames) is smaller than one multiplexing unit. 
5 [0293] Because one audio PES packet will fit into one multiplexing unit, complicated conversion such as used for 
video PES packet conversion is not needed. That is, as shown in Fig.41 B, a padding packet is always inserted to the 
generated packs. 

[0294] Furthermore, because the PES _packetjength does not change during TS2PS conversion, the only calculation 
needed for conversion is to appropriately set the stream_id during MPEG-1 Audio conversion. 
10 [0295] Fig. 42 shows the audio bit rate allowable in a Constrained SESF and the maximum payload stored to one 
audio PES packet when AC-3 and MPEG-1 Audio are stored. A padding packet is always inserted because audio data 
exceeding the maximum byte lengths shown here will not be stored to one audio PES packet 

<TS2PS conversion process> 

15 

[0296] The TS2PS conversion process is described in detail with reference to the flow charts in Fig. 43 to Fig. 54. 
[0297] Fig. 43 is a flow chart of the main TS2PS conversion process. This process starts in response to a TS2PS 
conversion request from the user. The first step is to seek the first SESF capsule where conversion starts (S1 1 ). Whether 
the SESF capsule to process is found is then determined (S1 2). If it is not, the process ends. If it is found an initialization 

20 process (S13) and capsule unit process (S14) run. 

[0298] Fig. 44 is a flow chart of the initialization process (S13). This process sets and initializes the variables used in 
the subsequent process, and starts by determining if a Tip packet was read (S21). If the Tip packet has not been read, 
the Tip packet is read (S22). The ATS value of the Tip packet is then written to variable ATSTip (S23), the PCR value 
of the Tip packet is written to variable PCRTip (S24), variable MLLnum defining the number of the multiplexing unit 

25 being processed is initialized to 0 (S25), and variable WA denoting the number of ATS overflows is initialized to 0 (S26). 
[0299] Fig. 45 is a flow chart of the capsule unit process (S1 4). This process likewise starts by reading one TS packet 
(S31 ). Whether the read TS packet is a Tip packet is then detected (S32). If the read packet is a Tip packet, processing 
ends. If it is not a Tip packet, whether the read TS packet contains an audio packet or video packet is detected (S33). 
If the TS packet contains neither an audio packet or video packet, control loops back to step S31, and TS packets are 

30 sequentially read until a TS packet containing an audio or video packet is detected (S31 to S33). If theTS packet contains 
an audio or video packet, the next 10 TS packets are also read (S34). MU_num is then incremented (S35). The ATS 
value from the first TS packet in the multiplexing unit is then written to variable ATS[MU_num] (S36). The byte length 
of the payload data in the PES packet of the multiplexing unit is set to payloadjen (S37). A pack unit process then runs 
(S38). 

35 [0300] Fig. 46 is a flow chart of the pack unit process (S38). This pack unit process consists of four subroutines: SCR 
calculation (S41 ), pack header process (S42), packet header process (S43), payload process (S44), and padding packet 
process (S45). Each of these subroutines is described below. 

[0301] The SCR calculation process is shown in Fig. 47. This process determines the pack SCR value. - 

[0302] Variable MU_num is first referenced to detect the first multiplexing unit in the capsule. If it is the first multiplexing 

40 unit the value of ATSTip is written to variable ATS[0], and PCRTip is written to variable SCR[0] (steps S51 to S53). 
[0303] ATS[MU_num] and ATS[MU_num-1]] are then compared (S55). The ATS value from the first packet in the 
multiplexing unit is stored to ATSp]. The ATS value denotes the relative transfer timing referenced to a given packet. 
The ATS value in a later packet is therefore normally higher than the ATS value of a preceding packet. However, because 
the ATS is a finite value definable in 30 bits, carry (column overflow) can occur. In this case the ATS value of a later 

45 packet could be smallerthan the ATS of a preceding packet. Step S54 monitors this reversal of ATS values and thereby 
determines if a carry (column overflow) occurred. If ATS[MU_num] is less than or equal to ATS[MU_num-1], that is, if 
the overflow is detected, variable WA is incremented (S55). 

[0304] The greater of SCR[MU_num-1]+T and (PCRTip + ATS[MtL_num] - ATSTip + WA x BS) is then substituted for 
SCR[MU_num] (S56). 

so [0305] The pack header process is described next with reference to Fig. 48. 

[0306] This process edits the pack header data with the data structure shown in Fig. 38. The remainder of SC R divided 
by 300 is written to SCR_extension (S61 ) and the quotient is written to SCR J>ase (S62), "0x6270" is written to program. 
mux_rate (S63), and "000b" is written to pack_stuffing_length (S64). Other field values are then edited appropriately to 
complete the pack header data (S65). 

55 [0307] The packet header process is described with reference to Fig. 49. 

[0308] This process starts with a stream ID routine to set the stream ID (S71). Whether the first TS packet in the 
multiplexing unit contains a PES packet header is then detected (S72). If the first TS packet in the multiplexing unit 
contains a PES packet header, a start-of-PES packet process runs (S73), and otherwise a non-start-of-PES packet 
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process runs (S74). Whether the first TS packet in a multiplexing unit contains a PES packet header can be determined 
by referencing the payload_unit_start_jndicator in the TS packet header or by directly detecting if the PES packet header 
start code is stored. 

[0309] The stream ID process is described next with reference to Fig. 50. 
5 [0310] This process sets the streamJD field value. If the type of stream being processed is "MPEG-2 Video", the 

streamjd is set to "OxEO" (S81, S82). If the stream type is "AC3-Audio", stream Jd is set to "OxBD" (S83, S84). If the 

stream type is "MPEG-1 Audio" and "Primary audio", streamjd is set to "OxCO" (S85, S86, S87). If the stream type is 

"MPEG-1 Audio" and "Secondary audio", streamjd is set to "0xC1" (S85, S88, S89). 

[031 1] The start-of-PES packet process is described with reference to Fig. 51 . 
10 [0312] Fig. 56 shows the PES packet structure of the MPEG standard in detail. This start-of-PES packet process edits 

the field values according to the structure shown in Fig. 56. 

[0313] If the stream type is "MPEG-2 Video" is detected first (S91). If it is, the value calculated from the following 
equation is written to PES_packetJength (S92). 

PES_packet_length = (3 + PES_header_data_length) + payloadjen 

[031 4] The 3 bytes from "1 0" to PES Jieader_datajength (see Fig. 56) in each field of the TS packet before conversion 
are copied directly to the corresponding field of the packet header of the converted MPEG-PS pack (S93). PTS_DTS_ 

20 flags in the TS packet before conversion is referenced to detect if a PTS is present (S94). If a PTS is present, it is copied 
directly to the corresponding field of the packet header in the converted MPEG-PS pack (S95). PTS_DTS_flags is 
likewise referenced to detect if a DTS is present (S96). If a DTS is present, it is copied directly to the corresponding field 
of the packet header in the converted MPEG-PS pack (S97). If the PES_extension Jlag is set to "1" (S98), step S99 runs. 
[0315] In step S99 the stream type is again detected, and the three bytes from PES_private_dataJlag to P_STD_ 

25 bufferjlag are overwritten according to the detected stream type. That is, if the stream type is "MPEG-2 Video" (S99), 
the three bytes from PES_private_data_flag to P_STD_buff er Jlag are overwritten with "0x1 E60E8" (S1 00). If the stream 
type is "AC3-Audio" (S1 01 ), "0x1 E603A" is written (S1 02). If the stream type is "MPEG-1 Audio" (S1 03), "0x1 E4020" is 
written (S104). 

[0316] The non-start-of-PES packet process is described next with reference to Fig. 52. 

30 [031 7] The 2 bytes from " 1 0° to PES_extension Jlag in the PES packet are set to "0x8000" (S1 11), and whether the 
payloadjen is less than 201 8 is detected (S1 1 2). The payloadjen is the data length of the PES packet in one multiplexing 
unit and is a maximum 184 x 1 1 = 2024 bytes. If payloadjen is less than 2018, PESJieader_dataJength is set to 0 
(S1 13). If payloadjen is greater than or equal to 2018, PESJieader_dataJength is set to (2025 - payloadjen) (S1 14) 
andthe PES packet is stuffed the byte length of PES Jieader_datajength (S1 15). The value calculatedfrom the following 

35 equation is set to PES_packetJength (S11 6). 



PES_packetJength = (3 + PESJieader_dataJength) + payloadjen 

[0318] The payload process is described next with reference to Fig. 53. 

[031 9] Variable i is first initialized to 1 (S1 21 ). The payload data of the PES packet stored to the i-th TS packet is then 
read (122) and added to the payload data of the pack (S123). Variable i is then incremented (S124). Steps S122 to S125 
repeat until i = 12 (S125), that is, until all TS packets in one multiplexing unit are processed. 
[0320] The padding packet process is described next with reference to Fig. 54. 

[0321] Whether PES_packetJength equals 2028 is first detected (S131 ). If PES _packetjength does not equal 2028, 
PES_packetJength of the padding packet is set to {(2028 - PES_packetJength) - 6} (S132), and the padding packet 
is added to the payload (S133). 



Claims 

1. A recording apparatus for recording multiplexed video information and audio information to a recording medium 
(100) as a first stream in a constrained format being convertible from the first stream to a second stream, 
the recording apparatus comprising: 

an encoding section (214) operable to encode the video information and audio information to be recorded to 
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the first stream according to the constrained format; 

a recording section (221) operable to record the encoded first stream to the recording medium; and 
a control section (212) operable to control the encoding section and recording section, 
the first stream having a structure for storing data segmented in first blocks, the second stream having a structure 
5 for storing data segmented in second blocks, the maximum data size of the first and second blocks being different, 

characterized in 

that with the constrained format a specific number of consecutive first blocks in the first stream are managed as a 
unit (Multiplexing Unit) (210), wherein the specific number is set so that a total amount of data to be stored in the 
10 unit does not exceed an amount of data to be stored in one second block, and all data to be stored in the same unit 

are of the same type of elementary stream, allowing a conversion of the first stream in the constrained format to the 
second stream by converting, by unit, the first blocks composing one unit to one second block without changing the 
multiplexing order of the first blocks, and 

that the control section (21 2) is adapted for determining time stamps to be added to the first blocks of the encoded 
*s first stream, 

wherein a plurality of consecutive units (21 0) in the first stream are managed as a capsule (200) in which a control 
block (tip) is inserted; 

wherein the first block located at the head of the unit (210) includes first time stamp information (ATSp]) indicating 
an input start time of said first block to a system decoder (21 8) based on a first reference value; 
20 wherein the control block (tip) contains first tip time stamp information (ATSjip) indicating an input start time of said 

control block provided to a system decoder (218) based on the first reference value, and second tip time stamp 
information (PCR_tip) indicating an input start time of said control block provided to a system decoder (21 8) based 
on a second reference value different from the first reference value; and 
wherein there is further provided: 

25 

means for calculating the second time stamp information calculated_PCR[i] of the first block located at the head 
of each unit and an input start time SCR[i], provided to the system decoder, of each second block included in 
the second stream converted from the first stream which are obtained from the following formulas, 
SCR[1 ]=calculated_PCR[1 ] 
30 SCRp]=max(SCRp-1]+T, calculated_PCRp]) (i =2, 3, ...) 

calculated_PCR[i]=PCRJip+(ATS[i]-ATSJip+C) 

where T is the minimum transfer time of a second block, and C is a correction factor for overflow of ATSp], 
means for determining if buffer underflow occurs in the encoded first stream in the constrained format or the 
second stream, the determination for the second stream being based on said calculation, and 
35 means for adapting the first time stamp information to be added to first blocks located at the head of the units 

such that buffer underflow does not occur in the encoded first stream and the second stream. 

2. The recording apparatus as described in claim 1 , 

wherein transfer rate conditions allowed for audio data and video data are different in the first stream and second 
40 stream, and 

the control section (212) controls the encoding section (214) such that the first stream meets these transfer rate 
conditions in both the encoded first stream and anticipated second stream. 

3. The recording apparatus as described in claim 2, 
45 wherein according to these transfer rate conditions: 

the maximum transfer rate allowed for a first block storing video data in the first stream is greater than or equal 
to the maximum transfer rate allowed for a first block storing audio data; and 

the maximum transfer rate allowed for a second block storing video data in the second stream is equal to the 
so maximum transfer rate allowed for a second block storing audio data. 

4. A recording method for recording multiplexed video information and audio information to a recording medium (100) 
as a first stream in a constrained format being convertible from the first stream to a second stream, 

the recording method comprising the steps of: 

55 

encoding the video information and audio information to be recorded to the first stream according to the con- 
strained format; and 

recording the encoded first stream to the recording medium, 
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the first stream having a structure for storing data segmented in first blocks, the second stream having a structure 
for storing data segmented in second blocks, the maximum data size of the first and second blocks being different, 

characterized In 

5 that with the constrained format a specific number of consecutive first blocks in the first stream are managed as a 

unit (Multiplexing Unit) (210), wherein the specific number is set so that a total amount of data to be stored in the 
unit does not exceed an amount of data to be stored in one second block, and all data to be stored in the same unit 
are of the same type of elementary stream, allowing a conversion of the first stream in the constrained format to the 
second stream by converting, by unit, the first blocks composing one unit to one second block without changing the 

10 multiplexing order of the first blocks, and 

that the method further comprises the step of determining time stamps to be added to the first blocks of the encoded 
first stream, 

wherein a plurality of consecutive units (21 0) in the first stream are managed as a capsule (200) in which a control 
block (tip) is inserted; 

15 wherein the first block located at the head of the unit (21 0) includes first time stamp information (ATSp]) indicating 

an input start time of said first block to a system decoder (218) based on a first reference value; 
wherein the control block (tip) contains first tip time stamp information (ATS_tip) indicating an input start time of said 
control block provided to a system decoder (218) based on the first reference value, and second tip time stamp 
information (PCR Jip) indicating an input start time of said control block provided to a system decoder (21 8) based 

20 on a second reference value different from the first reference value; and 

wherein the method further comprises the steps of: 

calculating the second time stamp information calculated_PCR[i] of the first block located at the head of each 
unit and an input start time SCRp], provided to the system decoder, of each second block included in the second 
25 stream converted from the first stream which are obtained from the following formulas, 

SCR[1 ]=calculated_PCR[1 ] 

SCR[i]=max(SCRp-1]+T, calculated_PCR[i]) (i =2, 3, ...) 
calculated_PCR[i]=PCR_tip+(ATSn]-ATS_tip+C) 

where T is the minimum transfer time of a second block, and C is a correction factor for overflow of ATSp], 
30 determining if buffer underflow occurs in the encoded first stream in the constrained format or the second stream, 

the determination for the second stream being based on said calculation, and 

adapting the first time stamp information to be added to first blocks located at the head of the units such that 
buffer underflow does not occur in the encoded first stream and the second stream. 

35 5. A computer-readable program for executing the method as described in claim 4 on a computer. 



Patentanspruche 

40 1, Aufzeichnungsvorrichtung zum Aufzeichnen von Multiplex- Video- und -Audiodaten auf einem Datentrager (1 00) in 
Form eines ersten Stroms in einem Komprimierungsformat, welche aus dem ersten Strom in einen zweiten Strom 
konvertierbar sind, 

wobei die Aufzeichnungsvorrichtung aufweist: 

45 eine Codierungseinheft (214) zum Codieren der als erster Strom aufzuzeichnenden Video- und Audiodaten 

gemaB dem Komprimierungsformat, 

eine Aufzeichnungseinheit (221) zum Aufzeichnen des codierten ersten Stroms auf dem Datentrager und 
eine Steuereinheit (212) zum Steuem der Codierungseinheit und der Aufzeichnungseinheit, 

50 wobei der erste Strom so strukturiert ist, dass Daten segmentiert in ersten BIScken gespeichert werden, und wobei 

der zweite Strom so strukturiert 1st, dass Daten segmentiert in zweiten BlOcken gespeichert werden, wobei der 
maximale Datenumfang der ersten und der zweiten Bl&cke unterschiedlich ist, 

dadurch gekennzelchnet, dass bei dem Komprimierungsformat eine bestimmte Anzahl aufeinanderfolgender 
erster Blocke in dem ersten Strom als Einheit (Multiplexeinheit) (210) behandelt werden, wobei die bestimmte Anzahl 
55 so festgelegt ist, dass die Gesamtmenge der in der Einheit zu speichemden Daten die in einem zweiten Block zu 

speichernde Datenmenge nicht ubersteigt, und alle in derselben Einheit zu speichemden Daten gleichartige Daten 
des elementaren Stroms sind, so dass eine Konvertierung des ersten Stroms im Komprimierungsformat in den 
zweiten Strom mfiglich wird, indem die eine Einheit bildenden ersten Bldcke einheltsweise in einen zweiten Block 
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konvertiert werden, ohne die Multiplexreihenfolge der ersten BI6cke zu Sndern, und 

dass die Steuereinheit (212) dazu eingerichtet 1st, Zeitmarken festzulegen, die den ersten Bldcken des codierten 
ersten Stroms anzufugen sind, 

wobei mehrere aufeinanderfolgende Einheiten (210) in dem ersten Strom als Kapsel (200) behandelt werden, in 
die ein Steuerblock (tip) eingefugt ist, 

wobei der erste am Anfang der Einheit (210) angeordnete Block eine erste Zeitmarke (ATS[i]) enthStt, die den 
Zeitpunkt des Beginns der Eingabe dieses ersten Blocks in einen Systemdecoder (218) auf der Basis eines ersten 
Referenzwerts angibt, 

wobei der Steuerblock (tip) eine erste Tip-Zeitmarke (ATSjip), welche den Zeitpunkt des Beginns der Eingabe des 
Steuerblocks in einen Systemdecoder (218) auf der Basis des ersten Referenzwerts angibt, und eine zweite Tip- 
Zeitmarke (PCRjip), welche den Zeitpunkt des Beginns der Eingabe des Steuerblocks in einen Systemdecoder 
(218) auf der Basis eines zweiten, vom ersten verschiedenen Referenzwerts angibt, enth&lt, und 
wobei auBerdem vorhanden sind: 

Mittel zum Berechnen der zweiten Zeitmarke calculated_PCR[i] des ersten Blocks am Anfang jeder Einheit und 
des Zeitpunkts des Beginns der Eingabe SCRp] in den Systemdecoder jedes zweiten Blocks in dem aus dem 
ersten Strom konvertierten zweiten Strom, die nach folgenden Formeln erhalten werden: 
SC R[1 ]=calcu lated_PC R[ 1 ] 

SCR[i]=max(SCR[i-1]+T, calculated. PC R[i] (i=2, 3, ...) 
calculated.PCR[i]=PCR_tip+(ATS[i]-ATS_tip+C) 

wobei T die MindestObertragungsdauer eines zweiten Blocks und C ein Korrekturfaktor fur den Oberlauf von 
ATSp] ist, 

Mittel zum Bestimmen, ob beim codierten ersten Strom im Komprimierungsformat Oder im zweiten Strom ein 
Leerlaufen des Zwischenspeichers auftritt, wobei die Bestimmung beim zweiten Strom auf der Basis obiger 
Berechnung erfolgt, und 

Mittel zum Anpassen der ersten, den ersten Biocken am Anfang der Einheiten anzufiigenden Zeitmarke in der 
Weise, dass im kodierten ersten Strom und im zweiten Strom kein Leerlaufen des Zwischenspeichers auftritt. 

Aufzeichnungsvorrichtung nach Anspruch 1, 

bei der Bedingungen der Ubertragungsgeschwindigkeit fur Audiodaten und Videodaten im ersten Strom und im 
zweiten Strom verschieden sind, und 

die Steuereinheit (212) die Codierungseinheit (214) so steuert, dass der erste Strom diese Bedingungen der Uber- 
tragungsgeschwindigkeit sowohl beim codierten ersten Strom als auch beim vorweggenommenen zweiten Strom 
erfullt. 

Aufzeichnungsvorrichtung nach Anspruch 2, 

bei der nach diesen Bedingungen der Ubertragungsgeschwindigkeit: 

die fur einen ersten Block mit Videodaten im ersten Strom maximal zuiassige Ubertragungsgeschwindigkeit 
groBer als die fur einen ersten Block mit Audiodaten maximal zulSssige Ubertragungsgeschwindigkeit Oder 
gleich groB ist; und 

die fur einen zweiten Block mit Videodaten im zweiten Strom maximal zulassige Ubertragungsgeschwindigkeit 
gleich der fur einen zweiten Block mit Audiodaten maximal zulSssigen Ubertragungsgeschwindigkeit ist. 

Aufzeichnungsverfahren zum Aufzeichnen von Multiplex-Video- und -Audiodaten auf einem DatentrSger (100) in 
Form eines ersten Stroms in einem Komprimierungsformat, welche aus dem ersten Strom in einen zweiten Strom 
konvertierbar sind, 

wobei das Aufzeichnungsverfahren folgende Schritte aufweist: 

Codieren der als erster Strom aufzuzeichnenden Video- und Audiodaten gemSB dem Komprimierungsformat; 
und 

Aufzeichnen des codierten ersten Stroms auf dem DatentrSger, 

wobei der erste Strom so strukturiert ist, dass Daten segmentiert in ersten Bldcken gespeichert werden, und der 
zweite Strom so strukturiert ist, dass Daten segmentiert in zweiten Blttcken gespeichert werden, wobei der maximale 
Datenumfang der ersten und der zweiten Bldcke unterschiedlich ist, 

dadurch gekennzelchnet, dass bei dem Komprimierungsformat eine bestimmte Anzahl aufeinanderfolgender 
erster BI6cke in dem ersten Strom als Einheit (Multiplexeinheit) (210) behandelt werden, wobei die bestimmte Anzahl 



27 



EP1 391 119 B1 



so festgelegt ist, dass die Gesamtmenge der in der Einheit zu speichernden Daten die in einem zweiten Block zu 
speichernde Datenmenge nicht Gbersteigt, und alle in derselben Einheit zu speichernden Daten gleichartige Daten 
des elementaren Stroms sind, so dass eine Konvertierung des ersten Stroms im Komprimierungsformat in den 
zweiten Strom m&glich wird, indem die eine Einheit bildenden ersten Blocke einheitsweise in einen zweiten Block 
konvertiert werden, ohne die Multiplexreihenfolge der ersten Blficke zu andern, und 

dass das Verfahren auBerdem den Schritt der Festlegung von Zeltmarken aufweist, die den ersten Blbcken des 
codierten ersten Stroms anzufugen sind, 

wobei mehrere aufeinanderfolgende Einheiten (210) im ersten Strom als Kapsel (200) behandelt werden, in die ein 
Steuerblock (tip) eingefQgt ist; 

wobei der erste am Anfang der Einheit (210) angeordnete Block eine erste Zeitmarke (ATS[i]) enthatt, die den 
Zeitpunkt des Beginns der Eingabe dieses ersten Blocks in einen Systemdecoder (218) auf der Basis eines ersten 
Referenzwerts angibt; 

wobei der Steuerblock (tip) eine erste Tip-Zeitmarke (ATS_tip) p welche den Zeitpunkt des Beginns der Eingabe des 
Steuerblocks in einen Systemdecoder (218) auf der Basis des ersten Referenzwerts angibt, und eine zweite Tip- 
Zeitmarke (PCRJip), welche den Zeitpunkt des Beginns der Eingabe des Steuerblocks in einen Systemdecoder 
(218) auf der Basis eines vom ersten Referenzwert verschiedenen zweiten Referenzwerts angibt, enthalt; und 
wobei das Verfahren auf3erdem folgende Schritte aufweist 

Berechnen der zweiten Zeitmarke calculated_PCR[i] des ersten Blocks am Anfang jeder Einheit und des Zeitpunkts 
des Beginns der Eingabe SCR[i] in den Systemdecoder jedes zweiten Blocks in dem aus dem ersten Strom kon- 
vertierten zweiten Strom, die nach folgenden Formeln erhalten werden: 
SC R[1 ]=calcu lated_PCR[1] 

SCR[i]=max(SCR[i-1]+T, calculated_PCRp] (i=2, 3, ...) 
calculated_PCRp]=PCR_tip+(ATSp]-ATSJip+C) 

wobei T die Mindestubertragungsdauer eines zweiten Blocks und C ein Korrekturfaktor fur ein Uberlaufen von ATS 
P) ist, 

Bestimmen, ob beim codierten ersten Strom im Komprimierungsformat Oder im zweiten Strom ein Leerlaufen des 
Zwischenspeichers auftritt, wobei die Bestimmung beim zweiten Strom auf der Basis obiger Berechnung erfolgt, und 
Anpassen der ersten, den ersten Blocken am Anfang der Einheiten anzufiigenden Zeitmarke in der Weise, dass im 
kodierten ersten Strom und im zweiten Strom kein Leerlaufen des Zwischenspeichers auftritt. 

5. Computerlesbares Programm zum Ausfiihren des in Anspruch 4 beschriebenen Verfahrens auf einem Computer. 
Revendlcations 

1 . Appareil d'enregistrement pour enregistrer des informations video multiplexers et des informations audio multiple- 
xers sur un support d'enregistrement (1 00) comme etant un premier f lot selon un format contraint qui sont conver- 
tibles du premier flot en un second flot, 
I'appareil d'enregistrement comprenant : 

une partie de codage (214) utilisable pour coder les informations video et les informations audio qui doivent 
etre enregistrees pour le premier flot selon le format contraint ; 

une partie d'enregistrement (221) utilisable pour enregistrer le premier flot code sur le support d'enregistrement ; 
et 

une partie de commande (212) utilisable pour commander la partie de codage et la partie d'enregistrement, 
le premier flot ayant une structure pour stocker les donnees segmentees dans les premiers blocs, le second 
flot ayant une structure pour stocker les donnees segmentees dans les seconds blocs, la taille de donnees 
maximale des premiers et seconds blocs etant diff6rente, caracterise en ce 

qu'avec le format contraint, un nombre specifique de premiers blocs consecutifs dans le premier flot sont g6res 
comme etant un element (element de multiplexage) (210), dans lequel le nombre specifique est fix6 de telle 
sorte qu'une quantite totale de donnees qui doivent §tre stockees dans I'eiement ne depasse pas une quantite 
de donnees qui doivent §tre stockees dans un second bloc et que toutes les donnees qui doivent £tre stockees 
dans le meme element sont du meme type que le flot 6lementaire, permettant une conversion du premier flot 
dans le format contraint en un second flot en convertissant, par element, les premiers blocs se composant d'un 
element en un second bloc sans changer Tordre de multiplexage des premiers blocs, et 
que la partie de commande (21 2) est adaptee pour determiner les marques temporelles qui doivent §tre ajoutees 
aux premiers blocs du premier flot code, 
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dans lequel une plurality d'eiements cons6cutifs (210) dans le premier flot sont g£r£s comme une capsule (200) 
dans laquelle un bloc de commande ("tip") est insere ; 

dans lequel le premier bloc situe & la tSte de P6l6ment (21 0) inclut les premieres informations de marque temporelle 
(ATS[i]) indiquant un temps de debut d'entree dudit premier bloc a. un decodeur de systeme (21 8) sur la base d'une 
premiere valeur de reference ; 

dans lequel le bloc de commande ("tip'') contient les premieres informations de marque temporelle (ATSjip) indiquant 
un temps de debut d'entree dudit bloc de commande pr6vu & un decodeur de systeme (218) sur la base de la 
premiere valeur de reference, et les secondes informations de marque temporelle (PCRJip) indiquant un temps 
de debut d'entree dudit bloc de commande pr6vu k un decodeur de systeme (21 8) sur la base d'une seconde valeur 
de reference differente de la premiere valeur de reference ; et 
dans lequel il est, en outre, pr6vu : 

un moyen pour calculer les secondes informations de marque temporelle "calculated_PCRp]" du premier bloc 
& la tete de chaque element et un temps de debut d'entree SCR[i], pr6vu au decodeur de systeme (218), de 
chaque second bloc inclus dans le second flot converti du premier flot qui sont obtenues & partir des formules 
suivantes, 

SCR(1) = calculated_PCR[1] 

SCRp] = max (SCR [i-1] +T, calculated_PCR[i]) (i = 2, 3, ... ) 
calculated_PCR[i] = PCRJip + (ATS[i] - ATSJip + C) 

ou T est le temps de transfert minimal d' un second bloc et C est un facteur de correction pour le debordement 
du ATS[i], 

un moyen pour determiner si un sous-d6passement de la capacite de la memoire tampon se produit dans le 
premier flot code selon le format contraint ou dans le second flot, la determination pour le second flot etant 
bas£e sur ledit calcul, et 

un moyen pour adapter les premieres informations de marque temporelle qui doivent etre ajoutees aux premiers 
blocs situ6s a la tete des elements de telle sorte que le sous-depassement de la capacite de la memoire tampon 
ne se produit pas dans le premier flot code et dans le second flot. 

Appareil d'enregistrement comme decrit dans la revendication 1 , 

dans lequel les conditions de Vitesse de transfert permises pour les donnees audio et pour les donnees video sont 
diff6rentes dans le premier flot et dans le second flot, et 

la partie de commande (212) commande la partie de codage (214) de telle sorte que le premier flot satisfa'rt & ces 
conditions de vitesse de transfert & la fois dans le premier flot code et dans le second flot prevu. 

Appareil d'enregistrement comme decrit dans la revendication 2, 
dans lequel, selon ces conditions de vitesse de transfert : 

la vitesse de transfert maximale permise pour un premier bloc stockant les donnees video dans le premier flot 
est plus grande que ou egale a la vitesse de transfert maximale permise pour une premier bloc stockant les 
donnees audio ; et 

la vitesse de transfert maximale permise pour un second bloc stockant les donnees video dans le second flot 
est 6gale a. la vitesse de transfert maximale permise pour une second bloc stockant les donnees audio. 

Precede d'enregistrement pour enregistrer des informations video multiplexees et des informations audio multiple- 
xees sur un support d'enregistrement (1 00) comme etant un premier flot selon un format contraint qui sont conver- 
tibles du premier flot en un second flot, 

le procede d'enregistrement comprenant les etapes consistant & : 

coder les informations video et les informations audio qui doivent etre enregistr6es pour le premier flot selon 
le format contraint ; et 

enregistrer le premier flot code sur le support d'enregistrement ; 

le premier flot ayant une structure pour stocker les donnees segmentees dans les premiers blocs, le second 
flot ayant une structure pour stocker les donnees segmentees dans les seconds blocs, la taille de donnees 
maximale des premiers et seconds blocs etant differente, caracterise en ce 

qu'avec le format contraint, un nombre specifique de premiers blocs cons6cutifs dans le premier flot sont g6r6s 
comme etant un element (element de multiplexage) (210), 

dans lequel le nombre specifique estfixe de telle sorte qu'une quantite totale de donnees qui doivent etre stockees 
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dans I'element ne depasse pas une quantity de donnees qui doivent etre stockees dans un second bloc et que 
toutes les donn6es qui doivent etre stock6es dans le meme element sont du meme type que le flot elementaire, 
permettant une conversion du premier flot dans le format contraint en un second flot en convertissant, par element, 
les premiers blocs se composant d'un Element en un second bloc sans changer I'ordre de multiplexage des premiers 
blocs, et 

que le precede comprend, en outre, Tetape consistant a determiner des marques temporelles qui doivent etre 
ajoutees aux premiers blocs du premier flot cod6, 

dans lequel une plurality d'etements consecutifs (210) dans le premier flot sont ge>6s comme une capsule (200) 
dans laquelle un bloc de commande ("tip") est insere ; 

dans lequel le premier bloc situe a la tete de I'element (21 0) inclut les premieres informations de marque temporelle 
(ATS[i]) indiquant un temps de d§but d'entree dudit premier bloc a un decodeur de systeme (218) sur la base d'une 
premiere valeur de reference ; 

dans lequei le bloc de commande ("tip") contient les premieres informations de marque temporelle (ATSJip) indiquant 
un temps de debut d'entree dudit bloc de commande prevu a un decodeur de systeme.(21 8) sur la base de la 
premiere valeur de reference, et les secondes informations de marque temporelle (PCR_tip) indiquant un temps 
de debut d'entree dudit bloc de commande prevu a un decodeur de systeme (21 8) sur la base d'une seconde valeur 
de reference differente de la premiere valeur de reference ; et 
dans lequel le precede comprend, en outre, les etapes consistant a : 

calculer les secondes informations de marque temporelle B calculated_PCR[i] ,, du premier bloc a la tete de 
chaque element et un temps de debut d'entree SCR[i], prevu au decodeur de systeme (21 8) de chaque second 
bloc inclus dans le second flot converti du premier flot qui sont obtenues a partir des formules suivantes, 
SCR(1) = calculated_PCR[1] 

SCR[i] = max(SCR[i-1]+T, calcu1ated_PCRp]) (i = 2, 3, ...) 
calculated_PCR[i] = PCRJip + (ATS[i] - ATSJip + C) 

ou T est le temps de transfert minimal d'un second bloc et C est un facteur de correction pour le debordement 
du ATSp], 

determiner si un sous-depassement de la capacity de la memoire tampon se produit dans le premier flot code 
selon le format contraint ou dans le second flot, la determination pour le second flot etant basee sur ledit calcul, et 
adapter les premieres informations de marque temporelle qui doivent etre ajoutees aux premiers blocs situes 
a la tete des elements de telle sorte que le sous-depassement de la capacite de la memoire tampon ne se 
produit pas dans le premier flot code et dans le second flot. 

Programme lisible par un ordinateurpour executer le precede comme decrit dans la revendication 4 sur un ordinateur. 
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